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Approximations to Stochastic Service 
Systems, with an Application to a Retrial Model 


By A. A. FREDERICKS and G. A. REISNER 
(Manuscript received February 1, 1978) 


This paper illustrates the usefulness of state-dependent, birth-death 
processes in reducing the dimensions of stochastic service systems. 
The approximation techniques introduced have wide applicability to 
general (finite) multidimensional, state-dependent, birth-death proc- 
esses. These techniques are introduced by considering the “classical” 
telephony problems dealing with trunk group overflow traffic from 
the point of view of state-dependent, birth-death processes. The main 
part of the paper then applies these techniques to a two-dimensional 
trunk group retrial model of Wilkinson and Radnik. The method, 
which reduces the W-R model to an approximate, easily-solved, one- 
dimensional model, makes use of the transition probabilities for state- 
dependent, birth-death processes. These are obtained via a simple 
extension of known results. We use the one-dimensional results to 
compute blocking for a range of parameter values (trunk group sizes 
and retrial rates) exceeding the computational limits of the W-R 
model. Maximum relative errors do not exceed 10 to 15 percent, while 
for most cases of practical interest the relative errors are less than 5 
percent. The approximation also provides insight into the region of 
applicability of even simpler retrial models. This one-dimensional 
retrial model actually applies to more general (finite) state-depend- 
ent, birth-death processes (e.g., loss-delay systems). 


I. INTRODUCTION 


The purpose of this paper is to illustrate the usefulness of state- 
dependent birth and death processes in reducing the dimensions of 


957 


stochastic systems. The principal application is the reduction of a two- 
dimensional retrial model, proposed by R. I. Wilkinson and R. C. 
Radnik,' to an approximate one-dimensional model, which is then 
readily solved. While algorithms’ exist for the numerical solution of 
the two-dimensional Wilkinson-Radnik model, the large number of 
states that are often-:needed can result in convergence difficulties. 

Section II presents the history of and motivation for the techniques 
used throughout the paper. The equations for the reduced one-dimen- 
sional retrial model are described in Section III. Their solution is 
discussed in Section IV. Section V contains numerical results and 
comparisons, and Section VI discusses the theoretical accuracy of our 
one-dimensional approximation. 

The main points of this paper are the dimension reduction of 
stochastic models using a state-dependent birth-and-death process and 
a method of solution of the resulting approximate model. However, 
the retrial problem considered here as an example is of interest in itself 
and has been extensively studied in the past. L. Kosten’ and J. 
Riordan‘ considered retrials coming back in a secondary, uncorrelated 
Poisson stream. J. W. Cohen’ allowed for negative exponential distri- 
butions in the interarrival times of calls, the holding times of calls, the 
duration of the time interval between two successive attempts by a 
subscriber whose call was blocked at the first attempt, and the time 
during which a subscriber continues to make repeated attempts. All 
these works* attest to the difficulty of modeling and obtaining numer- 
ical solutions to the retrial problem. We hope that the ease with which 
one can obtain reasonable numerical results by using state-dependent, 
birth-death processes will motivate readers to consider this as one 
possible approach to the simplification of probabilistic systems. 


Wl. HISTORY AND MOTIVATION FOR THE USE OF STATE-DEPENDENT 
BIRTH RATES 


One early use of state-dependent birth rates to reduce a multidi- 
mensional system to a one-dimensional model is given in Ref. 6. The 
motivating problem was the analysis of an alternate-routed telephone 
network. The system to be analyzed consists of one or more primary 
groups of servers, each with its own arrival process. Arrivals which 
find all servers in its primary group busy are offered to a common 
overflow group (see Fig. 1). With the common assumption of Poisson 
traffic for the underlying arrival processes and exponential service 
times, one can readily write down the appropriate birth-death equa- 
tions. For the case where there is only one primary group, the analysis 


* Those noted are only meant to be representative of various works concerned with 
the retry phenomenon. They are not meant to provide a complete list of such works. 
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Fig. 1—Overflow problem. 


is already somewhat complicated, although it has been carried out.’ 
Moreover, reasonably sized trunk groups quickly lead to systems where 
even numerical solutions are not feasible. However, one is often only 
interested in the behavior of the overflow group, e.g., finding which 
attempts are blocked there and hence lost from the combined system. 
In this case, the primary trunk groups are of interest only inasmuch as 
they supply the input to the overflow group. It is here that state- 
dependent, birth-rate modeling has proved of value.* But before noting 
some of the previous work on state-dependent birth rates related to 
this overflow problem, it is useful to consider the application of our 
basic ideas on dimensionality reduction. 

For the simplest case of one primary group of N servers overflowing 
to a group of M servers, Poisson input rate A, and (unit) exponential 
holding time, the birth-death equations can be written as 


(A + L+7)Pi = APi-1,; + (J + 1) Pij41 +(i+ LD Paay 


L<N 
(A+ N + 7)Pn; = APn-1,j + GV + 1) PN. j+1 + APN j-1 (1) 
i=N j<M 


(N + M) Pym = APn-1.m + APN mM-1 
(P;=0 if i or j<0O), 


where P,; is the probability that there are i busy servers in the primary 
group and / busy servers in the secondary group. 
Since our main interest is in the marginal distribution P.;, we begin 


* Actually, if total lost calls were the only item of interest, then the equivalent 
random method (Ref. 8) would perhaps be more applicable. However, we will be 
considering more general questions here. 
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by summing eq. (1) over i. The result after simplification is 
JP; + APnj = (7 + 1) Pjti + APN j-1 J<M 
MPmu=)\Pnu-1 jJ=M, (2) 
(P;,Pvn,;=0 if j< 9), 


where we have denoted the marginal distribution P.; by P;. 

If we subtract the equation for j = M from that for 7 = M — 1, and 
then proceed to subtract the new equation for each / from the old one 
for 7 — 1 we obtain the equivalent, but simpler, system 


APnj = (J + 1) Py, 0=<=j7/< mM. (3) 


Note that we could have obtained eq. (3) directly by balancing the 
upward transitions from j to 7 + 1 with the downward transitions from 
jJ + 1 to7, in equilibrium. In any event, by now using the fact that Py; 
can be written as Py; P7* and denoting the term AP»; by \;, we obtain 


VWP= VU +) Pi 0<j<M, (4) 


an apparent one-dimensional birth-death process. Care must be taken 
in this interpretation. The quantity A; is the average “birth” rate when 
there are 7 busy on the overflow group. That is, the input process can 
be characterized by a state-dependent birth rate only in an average 
sense. Thus, while (4) is a valid equation satisfied by the equilibrium 
probabilities P,;, other quantities that might be obtained by viewing 
this as a birth-death process (e.g., transitory behavior) would at best 
be approximate. Indeed, even to obtain the P,’s from (4) it would be 
necessary to determine the A; exactly. Since this can usually only be 
done by solving the combined system (primary plus overflow trunk 
group), a problem we wish to avoid, we turn to approximations for the 
\,’S. 

Linear birth rates (e.g., A; = a + 67) were suggested in Ref. 6 for the 
case where the overflow group is infinite. This results in a negative 
binomial distribution for the state probabilities. Determining the pa- 
rameters a and b by matching the mean and variance of the number 
of busy servers was found to result in a reasonably good approximation 
for the state probabilities. The idea of relating the birth rates to 
conditional probabilities was presented in Ref. 7. The resulting ap- 
proximation for the state probabilities for an infinite overflow group 
were found to be better than those obtained with the negative binomial 
distribution, particularly for large values of the number busy. 

Extensions of the negative binomial approximation (linear birth | 


* The notation Py, refers to the conditional probability of N (busy on the primary 
group) given 7 (busy on the overflow group). 
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rates) to a system with a finite overflow group are given in Refs. 8, 9, 
and 10. The approach in Ref. 9 is to solve for the equilibrium proba- 
bilities P{”) for an infinite overflow group, and then simply terminate 
and normalize these to approximate the state probabilitiesP!” for the 
finite group, 1.e., 


PE 
Pp =~—_. (5) 


a 

i=0 
Equation (5) still implies a linear birth rate of the form A; = a + Ou. 
The problem is that if the parameters a, 6 are adjusted to match 
moments on an infinite trunk group, then for the finite case the total 
offered load A = Yo Ai:P{” will no longer match the true offered load. 
This problem can be rectified by adjusting the offered load (see, for 
example, Refs. 9 and 10); however, this generally results in the need 
for an iteration procedure. For example, a choice of A; results in a set 
of probabilities P{”’, and hence an actual offered load A = YX 
A;P, which, if not the desired value, results in a need to adjust the 
A;. This feedback effect, where the equilibrium probabilities must be 
used to adjust the offered load, is a dominant feature in this type of 
approach to dimensionality reduction. We will see shortly that this 
interaction between the reduced state probabilities and the assumed 
state-dependent offered load is even stronger for the retrial model 
considered. 

An important point to note is that, independent of the initial 
motivation for the above approximations, they can all be interpreted 
as attempting to approximate the conditional probability that the 
primary group is busy, given that there are 7 busy on the secondary 
group. This interpretation is important since it can often lead to insight 
into the applicability of the resulting approximation. 

Before proceeding we note that state-dependent birth rates have 
also been used to study multilink systems offered overflow traffic and 
to obtain approximations for the blocking seen by the various parcels 
of traffic offered to an overflow group as depicted in Fig. 1.''~"” 


lit. APPLICATION TO THE WILKINSON-RADNIK RETRIAL MODEL 


A diagram of the Wilkinson-Radnik (W-R) retrial model is given in 
Fig. 2. The underlying offered load is assumed to be Poisson with rate 
A. If these attempts find all c servers busy, they may defect from the 
system (probability D,) or they may wait a period of time and then 
retry (probability R,; = 1 — D,). Thus, the number of people / waiting 
to retry is increased by one with probability R,, whenever a first 
offered attempt arrives to find the number of busy servers i equal to 
c. When a customer retries, he either finds an idle server (t < c) and 
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INFINITE NUMBER OF 
WAITING POSITIONS 


Ry=(1-Dy) 





> — POISSON FIRST OFFERED LOAD 

c — NUMBER OF SERVERS 

D, — PROBABILITY OF DEFECTION FOR FIRST BLOCKED ATTEMPT 

Dz — PROBABILITY OF DEFECTION FOR SUBSEQUENT BLOCKED ATTEMPT 
r — RETRIAL RATE (EXPONENTIAL DISTRIBUTION) 


Fig. 2—Wilkinson-Radnik retrial model. 


hence is carried by the system, or, if 1 = c, he may defect with 
probability D. or again wait and retry with probability R. = 1 — Dr. 
With the assumption of exponential times to retry, this system is 
completely characterized by the state (7,7 ), 1 = number of busy servers 
(1 =0, ---, ce), 7 = number of customers waiting to retry (J = 0, ---, 
00). 

Denoting the mean time to retry by 1/r and the mean server holding 
time by 1/p, we can readily write the state equations for the probabil- 
ities P;; = P* (i busy server, 7 waiting to retry). Assuming for simplicity 
that R; = R. = R, we obtain: 


(A +77 + pi) Py = APiiuny +r YU + 1) Pi-sj+1 
+ p(t + 1)Pisi; i<c 
(AR + rj(1— R) + pe) Pejy = AP A315 + r(J + 1) Pe-ij+i (6) 
+r(j7+1)(1 — R)Pj41 + ARP. j-1 
(PaO: 5 ZOE SO): 
While the W-R model is a reasonable one for the customer retry 
phenomenon in telephone systems, eq. (6) quickly lead to numerical 


problems, even for modest values of c. Here, as with the overflow 
problem, we see that the dimensionality difficulty is caused by an 


* P denotes probability. 
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aspect of the vector state (1, 7), namely j, that we often are not 
concerned with, except as it influences 1, the number of busy servers. 
Hence, following the procedure outlined in Section II, we sum (6) with 
respect to 7 and obtain 


(A +rE(j|t) + pi) P; =APi-1 + rE (7|t — 1) Pi-1 
+p(itVPanr ie (7) 


and 
(uc) P. =AP.-1 + rE(y|e — 1) Pe-1, 


where 
P; = ¥\ Py 


F 


The term A; = rE(j|t) represents the mean input intensity associ- 
ated with the retries. If this quantity were known exactly, then (7) 
could be solved to yield the exact solution for the equilibrium state 
probabilities P;. The use of (7) to compute other quantities such as 
transitory probabilities would again be an approximation. However, 
one would expect such an approximation to be good if in the two- 
dimensional model, the value of 7 (number of retry sources) did not 
vary much from its mean for a given value of i (number of servers 
busy). This idea will be explored later. 

Before discussing how to obtain an approximation for the Ai which 
clearly depends on the unknown P(1), we note that direct balance of 
flows across the (i, 1 + 1)-state boundary as before yields the simpler 
(but equivalent to (7)) state equations 


AP; = w(t + 1) Piss, (8) 
where 
AN =A+A/=A+TrE(J| 2). 


It is clear that there is a strong interaction between the state 
probabilities P,; and the retry intensity A;. We now turn our attention 
to modeling this rather complicated relationship, and hence obtaining 
a solution to (8) which will hopefully approximate the two-dimensional 
W-R retry model adequately. 


IV. SOLUTION OF THE ONE-DIMENSIONAL RETRIAL MODEL 


The “solution” to a one-dimensional birth-death process is, of course, 
well known, provided the birth (and death) rates are given. Thus the 
main problem we are faced with is determining the 4A,’s for the one- 
dimensional model so that they capture the essence of the two-dimen- 
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t-7T t—7r+At t t+At 


| 


ARRIVAL IN (t— 7, t— r+At] THEN RETRIAL WITHIN rTO 

FINDS ALL C TRUNKS BUSY t+d7 TIME UNITS SO THAT 
THE RETRY OCCURS IN THE 
INTERVAL (t, t+At] 


Fig. 3—Necessary events for a retrial arrival in the interval (¢, ¢ + A ¢) conditioned on 
an inter-retrial time equal to r. 


sional model. As indicated earlier, A; is the sum of A (first offered traffic 
intensity) and A, (retrial intensity when z trunks are busy). We thus 
have, to first-order in Af, that the A;’s satisfy the relationship 


Aj At = P" {exactly one retrial in (t,¢ + At]| N(t) =i}, = (9) 


where N(¢) is the number of busy servers at time ¢ and the superscript 
(1) indicates that this probability is for the one-dimensional model. 

What we would like is for P“” in (9) to be close (in some sense) to 
P® the corresponding probability for the two-dimensional model. 
Using the law of total probability, conditioned on the inter-retrial time 
of the arrival under consideration, we have* 


P“ {exactly one retrial in (¢, ¢ + At]| N(¢) = 2} 


= | PUNE) = 6, alt= 1,0, r6, dr)| N(t) =i}, (10) 


T 


where a(t — 7, At) is the event that an arrival occurs in (t — 7, t — 7 

+ At] and r(r, dr) is the event that he will retry if blocked within (7, 

7 + dr| time units. Figure 3 represents these events pictorially. (Note 

that the assumptions made for the two-dimensional model imply that 

each blocked arrival can be tagged with a time to retrial, 7, taken 

independently from a distribution F’,.(r), at the time of his arrival.) 
Using P(A|B) = P(B|A)P(A)/P(B) in (10) we obtain 


P?{.) - | Pe =i|N(t—1) =c, a(t —7,At), r(r, dr)) 


- P{N(t—7) = c,a(t—7,At),r(7,dr)} /P{N(t) =1}. (11) 


Denoting the events {a(t — r, At), r(7, dr)} by {A,} and using the law 
of total probability conditioned on J(¢t — r) and J(t), the number 
waiting to retry at times ¢ — 7 and ¢, result in 


* For simplicity, we omit all terms of higher order than Af in eqs. (10) to (16). 
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P?(.} = y y 


7/79 y=) 
»P{N(t) =i d(t-—7) =f, S(t) =j2.| N(t-7) = 0, A,} 


» P{A,}/p{N(t) = 1}. (12) 
Using P(A, B|C) = P(A|B, C)P(B|C), this becomes 


P&py=] YY. PLN(t) =i,d(0) 


AO fom 
=j.|N(t—7)=¢,d(t-7) =f, Ar} 


»-P{J(t—1) =f, |N(t-—71) =c, A-} P{A,} (13) 
P{N(t) = 1} | 

Now the value of J(¢ — 7) represents the retrial intensity at (¢ — 
T). In general, for the two-dimensional model, P{J(t — +) = 7;} does 
indeed depend not only on the value of N(¢ — 7), but on the fact that 
an arrival has just occurred. However, this latter dependence is incon- 
sistent with the one-dimensional model. More specifically, we have 
assumed that the retrial intensity depends only on the state of the 
one-dimensional system, N(t — +). Thus we are led to making the 
approximation 


P{J(t—7) =ji| N(t—7) =c,A,} 
=P{J(t-—T)=y1,N(t—7T)=c}. 


Note that this approximation should tend to underestimate the retrial 
intensity A. when there are c busy servers and hence underestimate 
the blocking, particularly as seen by the retrials. 

Using this approximation, the formula P(A|B, C)P(B|C) = 
P(AB|C), and noting that 


P{N(t) =1, J(t) =y2| N(t—7) =¢, J(t-—7) =, Ar} 
= P{N(t) =1,J(t) =j2 -1| N(t—-7) =e, J (t-7) =}, 


we obtain 


P%.y= | YY P(N(t) =1, S(t) =f S(t-7) 


=ji|N(t—7) =c} 
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-P{N(t— 7) = c} P{A,}/P{N(t) = 0} 


= [ Pave =i|N(t—7T) =c} P{(N(t—7T) =c} 
-P{A,-}/P{N)t) = t}. (14) 

Thus we require that 

NAt = [pow = i|N(t—7) 


= ¢} P"{N(t— 71) = 0} P™ {AJ /PY(N(t) =i}, (15) 


where we have used the superscript (1) to emphasize that the proba- 
bilities are for the one-dimensional model. Using the Markovian prop- 
erties and independence assumptions for a birth-death process, (15) 
can be written as 


AiAt = | Poi(t) (AAt) P-RdF;, (tr) Pi, (16) 


where P.i(7) = P{ N(r) =i|N(o) = c}, i.e., the transition probabilities 
A. = birth rate when N = c 
R = Pr {a blocked attempt will retry} 
F,.(r) = distribution function for the time of retry 
P,.=P{N=c 
_> equilibrium probabilities. 
P;=P {N =f 


All are from the one-dimensional model. 
Thus, finally, we have that the overall state dependent birth rates 
are given by 


M=A+X, | (17) 
where from (16) 
Ne 
N — | P,,(t) dF, (1). (18) 


Up to this point, we have not made any assumptions regarding the 
retrial distribution F’.. Before doing so, it is worth pointing out that eq. 
(18) provides the correct answer for the two limiting values of the 
retrial rate r. As r goes to infinity, retrials occur “immediately.” If we 
represent this by setting F,. equal to a unit step at zero, then (18) 
reduces to 
_ PAR 
=> 





Xi bei, (19) 
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where 6,; is the Kronecker delta function. Combining eqs. (17) and (18) 
we find that A; = A for z less than c, and A. = A/(1 — R). Thus, when 
retrials occur immediately, they do not influence the state of the 
system. With one or more servers idle, the load remains at A; with all 
servers occupied, an arrival retries infinitely fast and (for R < 1) will 
exit from the system before a server becomes idle. 

In the other limiting case, as r goes to zero, customers “come back 
in an uncorrelated stream.” Proceeding as above, and noting that 


| P.i(t) dF, (r) = P; asr— 0 
0 


(if we let F. approach a unit jump function at infinity), we find that 
A; =A + P.A-R for all i, and thus A; = A/(1 — P.R). That is, the traffic 
intensity increases by a factor of (1 — P.R)7', independent of the 
number of servers occupied. This is a familiar retrial model. In Section 
V we see that there is a wide range of parameter values for which this 
simple model is not a useful approximation for the proportion of 
retrials blocked. 

We now return to eqs. (17) and (18), and use them to calculate 
recursive formulas for the A,’s. We assume, as in the two-dimensional 
retrial model, that the time to retrial is given by a negative exponential 
distribution with rate r, namely 


F,.(r) =1—e". (20) 
In this case, the equations become 
PNR 5 , 
N=At HS Poly) t=1,---, 0c, (21) 





where denotes the Laplace transform. In particular, 


v 





A. =. 
1—rRP..(r) (22) 
If we let 
A; —-—A 
eS 2 ree 23 
‘J Nj-1 =e J > Cc ( ) 
then we can write 
Aaa + Vo" pe Late (24) 
i] 


Substituting (17) into (19) and simplifying using A,;-;P)-1 = pjP;, we 
obtain 


eg a a Weegee are oo 20 
: Nj-1Py-1(") : ) 
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Using formula for P.;(r) developed in the appendix,* we obtain a 
recursion for 6;: 


65=1+r7r/A0o and 6; 
= (r + Aj-1 + py-1) 6j-1 — By-1 


Ot Gan Sn 
at ae for J] : oe (26) 
Combining (26) with (22), (24), and the formula for P...(r), 
P.c(r) ee. cen (27) 


(r+ pe) 8. — pe)?’ 


we get an iteration scheme for finding the A;i = 0, - - - , c which satisfies 
the birth-death equations and is consistent with the derivation of the 
Si =0,---,C¢. 

Before leaving this section, we note that the above derivation holds 
equally well if the underlying system (without retries) is characterized 
by an arbitrary (finite) state-dependent, birth-death process. For ex- 
ample, it applies to the loss delay system considered in Ref. 4, and to 
overflow traffic characterized via state-dependent birth rates (as dis- 
cussed in Section IT). Moreover, using the results given in the appendix, 
one can compute transition probabilities and other related quantities 
for this system (e.g., correlation function). 


V. NUMERICAL RESULTS 


We assess the accuracy of the one-dimensional approximation by 
comparing our results to those obtained from direct numerical solution 
of the two-dimensional W-R model. The particular comparisons pre- 
sented here were chosen to give the reader an understanding of the 
value of the one-dimensional approximation for a wide range of param- 
eter values. However, due to the difficulty of computing “correct” 
values (i.e., from the two-dimensional model), the results may not 
cover every possible region of interest. In particular, it is difficult to 
analyze the convergence behavior of the two-dimensional algorithm 
for large trunk groups or small retrial rates. This difficulty arises 
because the number of waiting positions must increase to obtain a 
good approximation to an infinite waiting room; in turn, the number 
of states increases markedly and roundoff errors may become signifi- 
cant. Fortunately (see Section VI), the two models give the same 
results as r tends to infinity or zero. The approximation is worst for 
values of r around 2 and gets progressively better as r gets larger or 
smaller. 


* The appendix shows that the P;;(¢) of a general one-dimensional birth-death process 
may be obtained in precisely the same way they were obtained for a combined delay 
and loss system in Ref. 4. This fact was recognized and used in Ref. 14 (Appendix B). 


568 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1979 


We first look at retrial blocking, 1.e., the proportion of reattempts 
blocked. Figure 4 shows the retrial blocking for a system with two 
servers, as a function of offered load. We assume a probability of R = 
0.8 that a blocked customer will retry. As expected, the proportion of 
reattempts blocked increases as 7, the retrial rate, increases. For the 
two cases shown, r equal to 2.0 and 0.5, the relative difference between 
the two models is approximately 10 to 15 percent (as noted earlier, 
this is the worst case). We also see that the retrial blockings for the 
two retrial rates approach one another as offered load increases. 

For comparison, we show similar retrial blocking curves for systems 
with 5 and 30 servers, in Figs. 5 and 6, respectively. An interesting 
phenomenon can be seen here, but first observe that the offered loads 
in each of Figs. 4 through 6 correspond to values of call congestion 
ranging from 0.01 to 0.30, without considering retrials. For a given 
design load, without retrials, the percentage of reattempts blocked is 
much higher for a smaller number of servers. In particular, at 1-percent 
total blocking and r = 2.0, the retrial blocking is 51 percent for 2 
servers and 17 percent for 30 servers. This poor retrial performance of 
small server groups may be of importance in understanding customer 
satisfaction (or annoyance). We make one last point regarding retrial 
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Fig. 4—Retrial blocking, 2 servers, retrial probability = 0.8. 
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Fig. 5—Retrial blocking, 5 servers, retrial probability = 0.8. 


blocking by referring the reader back to Fig. 6. The curve for r = 0 
corresponds to the simply-computed “uncorrelated retrial’ model. We 
see that, for a high-design blocking, both the one- and two-dimensional 
models are well approximated by this simpler model. 

Figure 7 illustrates the dependence of retrial blocking and call 
congestion on trunk group size. The proportion of retrials blocked 
generally decreases as the group size N increases and seems to ap- 
proach an asymptote. The total call congestion also decreases for small 
to medium size groups, but then increases for N larger than 30 and 
approaches the same asymptote. Notice that, if we restricted our 
attention to the region where the two-dimensional model applies, we 
would not get a good view of the limiting behavior. The reason for the 
seemingly anomalous behavior of total call congestion is that the larger 
and exceedingly efficient trunk groups are correspondingly more sen- 
sitive to traffic above the design load. We now briefly discuss the 
limiting behavior. 

If we assume that retrials return in an uncorrelated Poisson stream 
(r = 0 case), then the inflated load A is given by the solution (deter- 
mined by iteration, e.g.,) to 
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Fig. 6 —Retrial blocking, 30 servers, retrial probability = 0.8. 
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~ 1 — B(N, A)-R’ ) 
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where B(N, A) is the Erlang-B blocking function. The resulting 
blocking (for both retrials and first attempts) 1s the lowest curve in 
Fig. 7. For large values of N, we use the well-known approximation 1 
— N/A for B(N, A) (see Ref. 15) in conjunction with (28) to obtain the 
asymptote, 


B 


1—R(1-B)’ a 


call congestion = 


for a fixed design blocking B. In Fig. 7, the asymptote 0.048 is shown 
by a dashed line. 

Another way of obtaining (29) is to assume that, for large trunk 
groups, \ — N equals the traffic lost on first attempts, (A — N) R equals 
the traffic lost on second attempts, --+, (A — N) R”"' equals the traffic 
lost on the kth attempts, etc. Then the total blocking is given by 
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Fig. 7—Call congestion with retrials for trunk groups engineered at B.01. 
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We close this section by pointing out that the one- and two-dimen- 
sional models yield practically the same values for the call congestion 
and the time congestion. A few sample values are shown in Table I. 


VI. THEORETICAL ACCURACY—A NUMERICAL LOOK 


As indicated in the derivation in Section III, the one-dimensional 
state equations may be obtained by summing the two-dimensional 
state equations over the number of waiting positions occupied. When 
this is done, the arrival rate of retries (when i servers are busy) equals 
the retrial rate times the expected number of waiting positions occu- 
pied (given z servers busy). In equation form, 


(= r-E(j|i). (30) 
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If one knew the conditional expectations E (j| i) precisely, then all the 
A,;’s, and hence all the P,’s of the one-dimensional equations, would be 
known exactly. The approximation occurs in the iteration procedure 
for finding the A,;’s where we implicitly assume that the one-dimen- 
sional equations actually come from a one-dimensional Markov model 
that adequately describes the retrial situation. 

Intuitively, if either (i) the standard deviation-to-mean ratio o;,;/ 
E(j|t) is very small, or if (uz) EV |t) + ko;|; is “equivalent” to EF (| z) 
(for some reasonable k), then for practical purposes we have a proper 
one-dimensional system; since knowing the number of busy servers 1 
implies that we know, {namely, FE (j|z)}. Hence, if either of conditions 
(zt) or (it) above hold, then we expect the one- and two-dimensional 
models to yield similar results. We look at a numerical example to 
illustrate the point. Table II shows the effect of varying the retrial 
rate, for five servers offered 2.22 Erlangs, while keeping all other 
parameters fixed. As the retrial rate goes to zero, the ratio o,;|5/E (j| 5) 


Table I—Time and call congestion (r = 2.0, R = 0.8) 


One-Dimensional Two-Dimensional 
Model Model 

Offered No. of Time Call Time Call 
Load Servers Cong Cong Cong Cong 
0.381 2 0.055 0.089 0.055 0.092 
2.22 5 0.062 0.084 0.063 0.088 
10.6 15 0.072 0.085 0.073 0.087 
24.8 30 0.081 0.092 0.084 0.100 
0.595 2 0.116 0.179 0.116 0.186 
2.88 5 0.136 0.176 0.136 0.188 
12.5 15 0.169 0.195 0.167 0.204 
28.1 30 0.194 0.212 0.197 0.227 


Table ll—Effect of retrial rate (Cc = 5 servers, offered load = 2.22 


Erlangs) 
Retrial 
Rate E(7|5) N';/r 0;|5 o;|5/E(j|5) 
128.0 0.058 0.058 (0%) * 0.24 4.14 
64.0 0.102 0.101 (1%) 0.32 3.14 
32.0 0.166 0.161 (3%) 0.42 2.53 
16.0 0.250 0.234 (6%) 0.52 2.08 
8.0 0.349 0.316 (9%) 0.64 1.83 
4.0 0.460 0.403 (12%) 0.76 1.65 
2.0 0.577 0.499 (14%) 0.88 1.53 
1.0 0.702 0.609 (13%) 1.00 1.42 
0.5 0.856 0.758 (11%) 1.14 1.33 
0.25 1.095 0.997 (9%) 1.31 1.20 
0.125 1.527 1.440 (6%) 1.55 1.02 
0.0625 2.398 2.300 (4%) 1.99 0.83 
0.03125 4,107 4.010 (2%) 2.62 0.64 
0.015625 7.515 7.422 (1%) 3.55 0.47 


* The number in parentheses is the relative difference between E( 19) and our one- 
dimensional approximation to it, A’s/r. 
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tends to zero [as do 0;;:/E(j|t), t = 0, 1, +--+ , 4—not shown in Table 
II]. At the same time, our approximation A; /rconverges to E (/| 5). 

At the other extreme, as the retrial rate tends to infinity, Table II 
shows again that A;/r converges to E(j|5). However, the standard 
deviation-to-mean ratio tends to infinity, proving that this is not a 
sufficient condition for convergence. On the other hand, FE (/| 5) + ko;\5 
~ E'(7|5), for any k. Intuitively, if we ignore the state i = c (= 5), we 
again have a one-dimensional Markovian system. Indeed, as r increases 
retrials occur instantaneously and the probability of having anyone in 
the waiting room tends to zero, if any server is free. 

In summary, \../r converges to E'(j|c) as r tends to zero or infinity. 
In these cases, the one-dimensional model gives the exact same answer 
as the W-R model but is much easier to compute (especially for r small 
since in this case a very large waiting room is needed). Unfortunately, 
the approximation seems worst for values of r around 2.0. Nevertheless, 
Table II shows only a 14-percent relative error in the approximation 
for E(7| 5) in this case. 
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APPENDIX 
Transition Probabilities of a One-Dimensional Birth-Death Process 

We show that one may obtain a solution for the transition probabil- 
ities for any finite-state, one-dimensional, birth-death process. The 
solution is not original; it precisely follows the derivation of the 
transition probabilities for the simplest combined delay and loss system 
given by J. Riordan (Ref. 4, pp. 96-98). 

Assume we have arbitrary state-dependent birth and death rates, 
A; and p;, respectively. Further assume that A; = 0 for 1 = c + 1 and 
define pp, = A-1 = pe+1 = 0. Then one obtains the usual system of 
ordinary differential equations 


Pit) = 4-1 Py-1(t) — (Ay + py) Pid) + py Pi+i(b) 
k,j = 0, 1,-+-,¢. (31) 


Since P;(0) = 0 for i + 7 and P,,(0) = 1, the Laplace transform of 
P;;(t) is given by sP;(s) for i # 7 and by —1 + sP;,;(s) for i = 7. Hence, 
the Laplace transform of eq. (31) is 


Ay-1Py-1(s) —(s+aAj+ 1) Pis(s) + Hj+iPij+1(s) = 0; 
i,7=0,1,---,¢c. (32) 
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For any 1 = 0, 1, --- , c we can write 
DIU; = 6, (33) 


where 46; is a c + 1 dimensional vector whose (i + 1)st component is 1, 
with all other components equal to zero, 


Pi. 
IT; = | Py |, 
Pye 


and 
Sto py 0 oo 0 0 
—No St+ Ai + p1 —pL2 0 0 
0 —z2 st+h2+ [tz —p3 0 
D= : 
0 
: 0 —)\c-2 s+ Ac-1 = pc—1 —Pe 
0 0 0 —ro-1 SH He 
If we define D;, A; via 
Do =— 1 
D, =s+ Ao 
Dizi = (8 + Ai + pi)Di — Ai-1piDi-1 Le 2) ee ct 
and 
Ao = 1 
A, =s+ pe 


Ain = (8S + Aci + Pe—1) Aj = Nei pbe-i+ 1 Ai-1, 


then the transforms, P,(s), of the transition probabilities are given by 


; AiAi41 eee Aj-1 Dj Ac-; L<J 
Det(D) Pi(s) = ) Mi+ipyj+2 °°° iD Ac-i L>J, (34) 
D;A--; l =] 


where the determinant of D can be written as 
Det(D) = DAs + Le) 7 Nc—iplneDe-1. 


Equation (34) is the key formula used in Section IV. 
We can go one step further and write down a “closed-form” solution 
for the P,,(t), namely, 


Dr) Dr) _,, 
P,(t) = SSMsin” 


where 


STOCHASTIC SERVICE SYSTEMS 575 


Op = Agee Nie 
M; = Bo eee bi 
« Di(r) 
S.(r) = 
() = 2 OM, 


and the sum its over the c roots of det(D) = 0. Of course, the use of this 
solution is limited by one’s ability to find characteristic roots. 
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Variable Rate Coding of Speech 


By J. J. DUBNOWSKI and R. E. CROCHIERE 
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In this paper, we examine a number of concepts and issues con- 
cerning variable-rate coding of speech. We formulate the problem as 
a multistate coder (t.e., a coder that can operate at several bit rates) 
coupled with a time buffer. We first analyze the theoretical aspects of 
the problem by examining it in the context of a block processing 
formulation. We then suggest practical methods for implementing a 
variable rate coder based on a dynamic buffering approach. We also 
allude to a multiple user configuration of variable-rate coding for 
TASI-type applications. A practical example of a variable rate ADPCM 
coder is presented and applied to speech coding. It is shown that by 
careful design the algorithm can be made to be as robust to channel 
errors as that of a fixed rate ADPCM coder. 


l. INTRODUCTION 


In the design of digital speech coders it is often assumed that the 
coder and channel operate at fixed bit rates. In reality, however, it is 
known that speech is an intermittent and nonstationary process, and 
that in many applications the user demand on a communication system 
is a variable process. In practice, these intermittent properties can be 
utilized to make the design of a communication system more efficient. 
For example, the first property, that of an intermittent source, is 
utilized in communication systems such as TASI (Time Assignment 
Speech Interpolation).'’ The second property, that of a variable 
demand on the system, is being explored by various authors for use in 
packet transmission systems*” and results in a variable rate channel 
from the point of view of the user. 

In both of the above systems, an important element of the system is 
a variable-rate coder. In its simplest form, it may amount to a trivial 
transmit/no transmit decision as was used in the initial TASI systems. 
More generally, we might characterize a variable-rate coder according 
to a configuration shown in Fig. 1 where both the source activity and 
the channel rate are assumed to be variable. 
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In this paper, we examine a number of concepts of variable rate 
coding. We formulate the problem as a multi-state coder (i.e., a coder 
with several transmission states) coupled with a buffer to take up the 
“slack” between the desired source rate and the channel rate. In 
Section II we investigate theoretical aspects of variable rate coding 
using block processing concepts and rate distortion theory. Section III 
covers practical aspects of implementing variable-rate coders and in 
Section IV we present an example of a variable-rate ADPCM coder. 


il. A BLOCK PROCESSING ANALYSIS OF VARIABLE RATE CODING 
2.1 Theoretical consideration 


To examine the theoretical performance of a variable-rate versus a 
fixed-rate coder, we can consider the problem in terms of a block 
processing problem. Figure 2a illustrates an example of a block of N 
samples of a zero mean nonstationary signal s() as a function of time 
n. For convenience, we assume that this signal is uncorrelated from 
sample to sample (as in the difference signal of a ppcm coder). 

Figure 2b illustrates the “short-time” variance or power of this 
signal, denoted as o7(n). The noise power introduced by the coder 1s 
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Fig. 2—Illustration of (a) a speech waveform and (b) its variance and quantization 
distortion after coding. 
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denoted as d*(n) and is also illustrated in Fig. 2b. As a performance 
criterion, we assume that the signal power to noise power ratio over 
the block, defined as 


N-1 
¥ o* (n) 


n=0 


es (1) 
y d*(n) 


s/n = 10 log 


is a sufficient measure for comparison. We discuss the practical merits 
of this measure later. 

For a fixed-rate coder, the same number of bits/sample, R,, is used 
for quantizing each sample s(n). Therefore, the number of bits, B, 
used to encode the total block is | 


B=R,N. (2) 


Also, from rate-distortion theory,”’ it is known that an approximate 
relationship between the bit rate and distortion of a quantizer is 


7 o*(n) 
Rrp=@+-— 5 Yore( Sr ) (3) 


where o7(n) is the variance of the signal as a function of time n, d’(n) 
is the variance of the quantization noise, and @ is a constant which is 
dependent on the characteristic of the quantizer and on the probability 
distribution of the signal. By rearranging (3), the distortion of the 
quantizer as a function of time can be shown to be 


d?(n) = o7(n)2?4 *, (4) 


By averaging d*(n) over N samples and applying the results to (1) and 
(2), the s/n of the fixed rate coder can then be shown to have the form 


s/n fixed = 20 (R; — Alogio2 (5a) 


B 
= 20( = A)owe2 (5b) 


This s/n does not include the additional prediction gain that can be 
obtained if the input to the coder is correlated. For our purposes in 
this section, we assume that all correlations in the signal have been 
removed prior to quantization and that this s/n represents only the 
signal-to-noise ratio of the residual (uncorrelated) signal. 

For the variable-rate coder, the number of bits/sample used to 
encode the nth sample is denoted as R(n) (where it is assumed that 
R(n) does not have to be an integer). The choice of R(n) for n = 0, 1,- 
--,NV -— 1 is then made such that the signal-to-noise ratio in (1) is 


VARIABLE RATE CODING OF SPEECH 579 


maximized and the total number of bits used to encode the block is B, 


1.€., 
N-1 


B= )) R(n). (6) 


The solution to this maximization problem is well known’® and results 
in the condition that the distortion power d*(n) at each sample must 
be identical, i.e., 

d?(1) = d?(2) =... d*(N— 1) = d?. (7) 


Therefore, to maximize the block s/n the noise generated by the 
variable-rate coder must be flat across time. The number of bits/ 
sample which must be used by the coder as a function of time is then 


1 o°(n) 
By applying (8) to (6), a relationship between the total number of 
bits in the block, B, and the distortion d? can be expressed in the form 


N-1 
B= ¥ Rin) 
n=0 
1 N-1 
= N6 + — loge [J o*(n) 
2 n=0 
N 
a loge d aft (9) 
2 
Rearranging terms and solving for di gives 
N-1 1/N 
di; = aaa | Il on) (10) 
n=0 
The signal-to-noise ratio for the variable-rate coder can now be deter- 
mined as N-1 
y o°(n) 
n=0 
and substituting in d? from (10) gives the revealing form 
B 
s/n | a 20( F - ” logio2 
1 N-1 
— ¥ o*(n) 
+ 10 logio a ‘ (12) 
(om) 
n=0 
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In comparing the s/n of the variable rate coder (12) to that of the 
fixed rate coder (5b), it is seen that the first term in (12) is identical to 
that of the fixed-rate coder. The second term therefore represents the 
improvement in block s/n that can be expected by using a variable- 
rate coder instead of a fixed-rate coder. As seen by the form of this 
term, this improvement is signal-dependent and is in fact equal to the 
ratio of the arithmetic to geometric means of the signal variance o7(n) 
over the block. If o*(m) varies widely over the block, i.e., if the signal is 
highly nonstationary, then this gain can be large. If o’(n) is relatively 
constant over the block, i1.e., the signal is approximately stationary, 
then the arithmetic and geometric means are essentially equal, and no 
improvement can be expected. 

It is interesting to note that this result is similar in form to that in 
transform coding.” In transform coding, the variation of the signal 
variance across the block corresponds to a variation in the frequency 
domain and occurs due to correlations in the input signal (i.e., a 
nonflatness of the signal spectrum). In the variable-rate coding appli- 
cation, we assumed that these correlations have already been removed 
and that the variation of the signal variance across the block occurs 
due to the nonstationarity of the signal in time. By a careful refor- 
mulation, however, both the effects of correlated inputs (i.e. prediction 
gain) and nonstationarity can be incorporated into the above relations 
for the variable rate coder. 

The reader should also be cautioned that while block s/n 1s appealing 
mathematically it may not be the most appropriate criterion in terms 
of perception.”"” In practice, some modification of the block s/n 
criterion may be required. Since little is presently understood about 
the perceptual effects of the distribution of s/n in time, this is a subject 
that requires further study before a more perceptually meaningful 
criterion can be proposed. Further comments on this subject are 
presented in Section V. 


2.2 Potential improvements of variable rate over fixed rate coding 
2.2.1 Single speaker 


To obtain estimates of the theoretical improvement in block s/n for 
a variable rate coder, we have measured the arithmetic-to-geometric 
mean ratio of the signal variance o*(n) over blocks for speech data for 
a (one-sided) telephone conversation. The signal variance o°*(n) was 
obtained by running a 4-bit ADPcM coder'' on the sentence and using 
the step-size of the coder as a (scaled) estimate of o(n) of the differ- 
ential input to the quantizer. Figure 3a shows an example of the speech 
waveform and Fig. 3b shows the corresponding scaled estimate of o(n) 
for the differential input to the quantizer in this coder. 

The variance estimate o*(n) of the coder was partitioned into blocks 
of size N and the ratio 
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Fig. 3—Example of (a) a speech signal and (b) the estimate of o(n) of its first-order 
predicted difference signal (8-kKHz sampling rate). 
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was computed for each block to obtain an estimate of the potential 
s/n gain for variable rate coding. 

__ The solid line in Fig. 4 shows a plot of the average G, denoted as 
G, for this sentence as a function of the block size N in milliseconds. 
As seen in Fig. 4, significant gains in s/n cannot be expected with 
variable-rate coding of a single speech source until block sizes greater 
than about 100 ms are used. That is, the size of the block must be 
greater than the typical duration of phonemes and micro-silence in 
speech before improvements in s/n can be realized. 

In real-time communications systems, blocks of this size may not be 
acceptable because they imply large transmission delays. Other poten- 
tial applications exist, however, in voice-storage and message “store- 
and-forward” systems where delays may not be of concern. An alter- 
nate advantage that is offered with variable-rate coding is that it allows 
greater flexibility in gracefully varying the transmission rate of the 
coder rather than restricting it to rates which are a multiple of the 
sampling rate. Block sizes can be relatively small to achieve this 
purpose. 


G = 10 logio (dB) (13) 


2.2.2 Multiple speakers (TASI) 


When several sources share a channel, possibilities exist for greater 
improvements in overall performance due to TASI advantages. One 
possible approach to encoding P sources into a single channel, in a 
block fashion, is to assign each source a sub-block of size N/P. By 
concatenating the P sub-blocks into a single large block of size N, the 
problem can again be treated as a single source problem. 
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Figure 5 illustrates such an example where the variances of P 
concatenated sources oj(n), ---, op(n) are plotted as a function of 
time n. If the sources have greatly different variances, as depicted in 
Fig. 5, then the effective concatenated signal over the block will appear 
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Fig. 4—Arithmetic-to-geometric mean ratio of the signal power (expressed in decibels) 
of a sentence as a function of the block size. 
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Fig. 5—Block formulation of a multi-user variable rate coder. 
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to be highly nonstationary and will therefore have a large arithmetic- 
to-geometric mean ratio. This suggests that a large s/n gain over the 
block can be obtained using variable-rate coding instead of a fixed bit/ 
sample assignment. In effect, sources with larger variances will receive 
more bits and sources with lower variances will receive fewer bits. 
Each source will effectively receive the same amount of noise power as 
shown by eq. (7). Whether this is the most appropriate choice from a 
subjective point of view is again a question which remains unanswered 
at this time. _ 

The dashed lines in Fig. 4 indicate measured values of G for 2, 4, 8, 
and 12 shared users. The gains along the left vertical axis are strictly 
due to TASI gains alone. 


lil. PRACTICAL CONSIDERATIONS IN IMPLEMENTING VARIABLE RATE 
CODERS 


3.1 A block processing approach 


- In Section II, we assumed for purposes of analysis that the variable 
rate coder is implemented in a block processing manner with a fixed 
total number of bits B allowed in each block. Practical bit allocation 
schemes for this type of implementation have been investigated for 
use in transform coding’”® and can be carried over to the variable-rate 
coding application as well. Since this can be done in a relatively 
straightforward manner, we will not go into detail on this approach. 


3.2 Dynamic buffer approach 


An alternative approach to variable rate coding can be realized using 
a dynamic buffering strategy. A similar approach has been investigated 
by Tescher and Cox for use in image coding.’” The method is illustrated 
in Fig. 6 for a single source example. The coder receives speech samples 
s(n) at a fixed sampling rate and encodes them with a variable number 
of bits/sample. The output bits of the coder are then stored serially in 
a dynamic first-in, first-out buffer and the channel receives output bits 
from the buffer at the channel rate. A buffer control monitors the state 
of the buffer and a variance estimate of the input signal and regulates 
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Fig. 6—Block diagram of a variable-rate coder based on buffering the output bit 
stream. 
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the number of bits/sample used by the coder. At the receiver, a similar 
variable rate decoding process takes place. 

When the activity in the source is high, the buffer control increases 
the number of bits/sample used by the coder and decoder respectively 
above the channel rate. The transmitter buffer begins to fill up, and 
the receiver buffer begins to drain out. When the source activity is 
low, the coder and decoder use less than the average number of bits/ 
sample and the reverse process takes place. The total signal delay 
across the coder/decoder is fixed at a value equivalent to the buffer 
size, B bits, divided by the channel rate (bits/second). 

An alternative dynamic buffer strategy, based on buffering the data 
samples, is shown in Fig. 7. In this case, the buffer supplies samples to 
the coder at a variable rate. The buffer control adjusts this rate as a 
function of buffer status and signal variance while matching the output 
rate of the coder to that of the channel rate (bits/sample). When the 
source activity is high, the actual sampling rate transmitted through 
the channel lags the source rate. This causes the transmit buffer to fill 
and the receive buffer to empty, as in the scheme of Fig. 5. Conversely, 
when the source activity is low, the channel transmits samples at a 
rate greater than the input source rate. This results in filling the 
receiver buffer while depleting the transmitter buffer. The total signal 
delay for the system is equal to the buffer size N (samples). 


3.3 Buffer control 


In both the above dynamic buffering approaches, a key element in 
the algorithm is the buffer control. In this section, we propose a 
technique for implementing this control in a recursive manner which 
applies to either of the above methods. 

The algorithm 1s based on the rate distortion relation (8), which can 
be expressed in the form 


a°(n) 
d3(n)’ 


where d;(n) denotes the (scaled) distortion level in the quantizer and 
includes the factor @ in (8). Therefore, 


Rin) = loge (14) 






BUFFER 


BUFFER 


Fig. 7—Block diagram of a variable-rate coder based on buffering the input samples. 
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d3(n) = d?/2°, (15) 


where d2(n) will be allowed to vary “slowly” with time in a manner 
which will be described shortly. 

In practice, (14) must be modified to account for overflow and 
underflow of the buffers. Let B denote the size of the buffer and b(n) 
denote the number of bits stored in the transmitter buffer at time n. 
Furthermore, let R.(n) denote the actual number of bits/sample used 
by the coder at time n and Ra(n) be the number of bits removed from 
the buffer during the sample period at time n for transmission over 
the channel. If the transmitter buffer is full, then the coder cannot be 
permitted to use more than Ru(n) bits/sample and if the buffer is 
empty the coder cannot be permitted to use less than Ra(n) bits/ 
sample. Therefore, the following algorithm applies: 


[R(n)] s Ra(n) if b(n) =B 
R.(n) = 4[R(n)] otherwise (16) 
[R(n)] = Ra(n) if b(n) = 0, 


where [R(n)] implies the operation of rounding R(n) in (14) to the 
nearest integer, < implies reducing R.(n) to be less than or equal to 
R.(n), and = implies increasing R.(n) to be greater or equal to Ra(n). 

While the constraints in (16) prevent the buffers from overflowing 
or underflowing, they are not sufficient to assure that the buffers will 
be effectively utilized. If the average R(n) is too large or too small, the 
buffers will remain in a state of being near full or near empty, 
respectively. To efficiently utilize the buffers, the average rate of R(n) 
should be close to that of the channel rate. This is similar to eq. (6) in 
the block processing approach, which states that the average bit rate 
over the block is equal to the channel rate. This condition must be 
realized by adjusting the distortion level d3(n). If the transmitter 
buffer is excessively full for long periods of time, then it can be seen 
that d2(n) is too small and should be increased. Alternatively, if the 
transmitter buffer is empty for long periods of time, then d2(n) is too 
large and should be reduced. 

The algorithm that we have investigated for controlling di(n) is 
based on the recursive relation 


d;(n) = ds(n — 1)-H(b(n — 1), (17) 


where d2(n) is the distortion level at time 7, d2(n — 1) is the distortion 
level at time n — 1, and H(b(n — 1)) is a multiplication factor which is 
dependent on the number of bits 6(n — 1) in the transmitter buffer at 
time n — 1. Figure 8 illustrates an example of H(b(n — 1)) as a function 
of b(n — 1). The exact shape of H(b(n — 1)) is not overly critical, 
except that it should be monotonically increasing and be less than 1 
for b(n — 1) near zero and greater than 1 for b(n — 1) near B. The 
intercept where H(b(n — 1)) = 1 determines the average buffer level 
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Fig. 8—Multiplier value H(b(n — 1)) as a function of the buffer status b(n — 1). 


about which the b(n) fluctuates. If H(6(n — 1)) 1s close to 1 for all b(n 
— 1), the algorithm will adapt slowly and if it becomes greatly different 
than 1, the algorithm will adapt rapidly. Typically, the time constant 
for adaption should be on the order of the total buffer delay, B. In 
practice, a piecewise approximation to H(b(n — 1)) is probably suffi- 
cient. Also, it is desirable in practice to set maximum and minimum 
levels for d2(n), i.e., 

din = d2(n) S hee (18) 


This algorithm for controlling d3(n) is similar in many respects to the 
one-word memory algorithm proposed by Jayant, Flanagan, and Cum- 
miskey’’ for adapting the step-size of an ADPCM coder. 

A choice exists in generating the buffer control algorithm at both 
the receiver and transmitter, or at the transmitter alone. If the latter 
choice is made, additional information must be transmitted along with 
the serial data to indicate code word size. In either case, recovery from 
channel errors is essential. One example for accomplishing this recov- 
ery is discussed in the next section. 


IV. AN EXAMPLE OF A VARIABLE-RATE ADPCM CODER 
4.1 Basic design 


To investigate the properties of a variable-rate coder, we have 
implemented a modified version of the algorithm in Fig. 7 by computer 
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simulation. A block diagram of this implementation is shown in Fig. 9. 
The variable rate coder was designed around an ADPCM (adaptive 
differential pcm) coder'’ that can operate at 2, 3, 4, or 5 bits/sample. 

The output of the ADPcM coder is framed into packets of typically 
.60 bits with a 2-bit header preceding each packet. The buffer control 
updates the number of bits/sample, R.(m), used by the ADPCM coder 
once per packet and transmits this decision to the receiver by means 
of the 2-bit header. It receives information on the buffer status b(n) 
from the input buffer and an estimate of the signal variance o*(n) from 
the ADPCM coder. 

Each packet is encoded with either 2, 3, 4, or 5 bits/sample corre- 
sponding to 30, 20, 15, or 12 samples of data per packet respectively. A 
packet length of 60 bits (plus 2 header bits) is chosen because it is the 
smallest common multiple of 2, 3, 4, and 5 and results in a fixed packet 
size independent of the number of bits/sample used by the ADPCM 
coder. 

Because the buffer control transmits the number of bits/sample, 
R.(n), used for encoding each packet the receiver algorithm is simpli- 
fied and does not require a buffer control computation. This, coupled 
with the fixed packet size, allows for an overall variable rate coder that 
is more robust in recovering from channel errors. If a buffer control 
computation were to be used in the receiver, its variance information 
o7(n) would have to be obtained from the ApPcM decoder and it would 
be highly susceptible to channel errors in the data. Once synchroni- 
zation between the transmitter and receiver is lost it may not be able 
to recover because the receiver may be using incorrect bits for decod- 
ing. With the algorithm proposed here, this cannot happen. Synchro- 
nization is unaffected by errors in the data stream. If an error occurs 
in the header bits, a misalignment of the transmitter and receiver 
buffers can occur. This type of error is not disastrous, however, and is 
recoverable with this algorithm as will be seen later. 
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Fig. 9—Block diagram of the variable rate ADPCM coder. 
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4.2 The ADPCM coder 


Figure 10 is a block diagram of the ADPpcM coder/decoder. The 
signal, e(n), resulting from the difference of the input x(n) and its 
predicted value y(n), is quantized using an adaptive step-size quantizer. 
The predicted signal, y(7), is obtained from a first-order predictor, as 
seen in the figure. In the receiver, the difference signal é’(n) is decoded 
from an adaptive step-size decoder and the first-order predictor loop 
is used to generate the output signal x’(n). 

The step-size logic adapts the quantizer step-size to track the rms 
level o(n) of the error signal e(7) and is based on the one-word memory 
algorithm proposed by Jayant, Flanagan, and Cummisky.'’ Letting 
A(n) represent the step-size at time n and A(n — 1) represent the step- 
size at time n — 1, this algorithm is described by the relation 


A(n) = A(n — 1)-M(| e(n — 1)]). (19) 


M(|c(n — 1)]) is a multiplication factor that depends on the magnitude 
of the code word c(n — 1) at time n — 1. If upper quantizer levels are 
used, a value of M greater than one is used and if lower quantizer levels 
are used, a value of M less than one is used. The M values that were 
used are close to the values proposed by Jayant." 

In the operation of the variable rate coder, the number of bits/ 
sample used by the apPcMm coder changes and the step-size must be 
adjusted accordingly. This adjustment is made in such a way that the 
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Fig. 10—Block diagram of the ADPCM coder. 
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center of the quantizer characteristic for the new bit rate is matched 
to the center for the previous bit rate. This alignment is illustrated in 
Fig. 11 for the 2-, 3-, 4-, and 5-bit quantizer characteristics. The 
horizontal scale denotes the (appropriately normalized) input signal 
e(n) to the quantizer and the vertical scale denotes the (appropriately 
normalized) output signal é(m) from the quantizer (plotted only for 
positive values of e(n) and é(n)). The step-sizes Az to As denote the 
relative step-sizes for the 2- to 5-bit/sample quantizer characteristics, 
respectively. By adjusting the step-size in this way, the loading factor 
and the dynamic range of the quantizer remains approximately the 
same when the number of bits/sample is changed—only the resolution 
changes. 


4.3 Variance estimation 


The buffer control requires an estimate of the variance of the signal 
e(n) to compute the bit allocation R,(n). This estimate is obtained 
directly from the step-size adaptation algorithm in the ADPCM coder. 
It can be observed that, for a given loading factor and a given number 
of bits/sample, the square root of the signal variance, o(n), of the 
signal e(7) is proportional to the step-size. 
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0 1 2 
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Fig. 11—Quantizer characteristic for 2- to 5-bit/sample characteristics. 
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The M values that were used here resulted in approximately a +20 
loading factor for each of the quantizer characteristics which is close 
to the optimum loading (in the mean-square error sense) proposed by 
Max.” This results in a quantizer characteristic centered about the 
variance of the signal as illustrated in the scale above Fig. 11. The 
estimate o(n) is therefore identified as the center of the quantizer 
(magnitude) characteristic which varies adaptively with the step-size 
adaptation. 


4.4 Bit rate assignment 


The buffer control determines the bits/sample assignment of the 
ADPCM coder, and it is based on the rate distortion relation in eq. (14). 
To give added flexibility to the algorithm, we also allowed a scale 
factor L in this equation to regulate the sensitivity of the bit allocation 
decision. This relation has the form 

2 
o“(n) 
R(n) =L1 —— }, 20 
where L is a parameter that can be adjusted. 


4.4.1 Open loop control 


To obtain an understanding of the range of values that L and d3(n) 
can take, we first ran the variable rate coder with an unlimited size 
input buffer and an open loop control of the bit assignment (i.e., d3(n) 
was fixed). The parameters L and d2(n) were adjusted as control 
parameters and the bit allocation was chosen on the basis of eq. (20) 
rounded to the nearest value 2, 3, 4, 5, or 6. The average bit rate used 
to encode a single sentence was then measured as a function of L and 
d?(n). 

Figure 12 shows a plot of the average bits/sample, R(n), used by 
the coder, for this sentence, as a function of L and d 2(n). As seen in 
the plot, as L increases, R(n) becomes more sensitive to variations in 
d?(n). Also, the range of d2(n) over which the average coder bit rate is 
between 2 and 6 bits/sample is clearly observed in this figure. These 
. results were found to be useful in establishing practical limits for d3(n) 
when the adaptive buffer control is used. 


4.4.2 Closed-loop dynamic buffer control 


In the closed-loop buffer control, a limited size buffer was used, and 
a bit/sample allocation was made once per 60-bit packet, as described 
earlier. The bit allocation R.(nm) was made on the basis of the scaled- 
rate distortion relation of eq. (20), and the allowed distortion d3(n) 
was “slowly” varied according to the relation in eq. (17). A two-piece 
approximation to the multiplier value H(b(n — 1) (see Fig. 8) was used 
in simulations according to the relation 
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Fig. 12—Range of R(n) as a function of L and d<(n) for an open loop control. 


A>l, if b(n—-—1)2N/2 


VA<1, if b(n—1) <N/2, (21) 


H(6(n — 1)) = 
where b(n — 1) is the number of samples in the input buffer in Fig. 9 
and Nis the size of the buffer. The value of A is greater than 1 and can 
be adjusted to control the speed at which d<(n) is allowed to vary. In 
general, the algorithm attempts to keep the buffer approximately one- 
half full (on the average). 
If the buffer becomes full or empty, an additional constraint on the 
number of bits/sample, equivalent to that of eq. (16), is imposed to 
keep the buffer from overflowing or underflowing. 


4.5 Performance of the variable rate ADPCM coder 


The operation of the variable rate coder was observed with various 
parameters. In this section, we briefly illustrate the effects of some of 
these parameters. 

Figure 13 shows a typical response of the variable rate coder for the 
sentence, “A lathe is a big tool.” The parameters of the coder were: 


N = buffer size = 1024 
L = rate distortion scale factor = 1 

A = buffer adaption parameter = 1.05 
R.. = fixed channel bit rate = 32 kb/s 
S; = input sampling rate = 8kHz 


Figure 13a shows the input speech waveform, Fig. 13b shows the 
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Fig. 13—(a) Input speech waveform. (b) Variance o*(m). (c) Transmitter buffer status. 
(d) Receiver buffer status for the variable-rate ADPCM coder. 


variance estimate of the difference signal e(n), and Figs. 13c and 13d 
show the number of samples in the transmitter and receiver buffers 
respectively. It can be noted that the receiver buffer status is the 
complement of the transmitter buffer status as discussed in Section 
3.2. 

As seen in Fig. 13, when the signal maintains a high level of activity, 
the transmitter buffer fills to capacity. When it becomes full, at time 
a (see Fig. 13c), the bit rate of the coder is limited to that of the 
channel rate to prevent overflow. At time 5, the speech activity drops 
and the buffer begins to drain out. It fluctuates with speech activity 
until a silent region is encountered at time c. At this point, the coder 
rate is again fixed to that of the channel rate to prevent underflow of 
the buffer. 
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The effects of buffer adaptation corresponding to values of A = 0.99, 
1.0, 1.025, 1.05, 1.11, and 1.2 are shown in Fig. 14b for the same sentence 
with a buffer size of N = 1024 samples. It can be seen that, when A is 
less than one, the buffer control is unstable and as A becomes larger 
(1.2) the activity of the buffer is reduced. Figure 14c shows a similar 
result for a buffer size of N = 256 samples and it can be seen that, for 
smaller buffer sizes, the buffer fills up or drains out more often. 


4.6 Recovery from channel errors 


As pointed out earlier, the synchronization of the transmitter and 
receiver in the algorithm of Fig. 9 is unaffected by channel errors in 
the data. Resistance to these types of errors can be improved with a 
robust modification of the appc step-size adaption algorithm.” 

An error in the header, however, can result in an incorrect bit 
allocation in the receiver and the loss of a 60-bit packet of data. In 
addition, the receiver buffer will receive an incorrect number of sam- 
ples resulting in an audible click and a misalignment of the transmitter 
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Fig. 14—-(a) Variance of input speech waveform. (b) Buffer status for A = 0.99, 1.0, 
1.025, 1.05, 1.1, and 1.2 (block size = 1024 samples). (c) Buffer status for A = 0.99, 1.0, 


1.025, 1.05, 1.1, and 1.2 (block size = 256 samples). 
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and receiver buffers. This misalignment results in a temporary time 
shift between the transmitter and receiver (i.e., a change in total input 
to output delay) which is not audible to the listener. 

A re-alignment of the receiver buffer will occur automatically when 
the buffer becomes full or empty, at which time a second signal error 
and time shift will occur. Two types of errors can occur, depending on 
whether the receiver buffer has an excess of samples or is missing 
samples. The first type of error is corrected when the receiver buffer 
becomes full and the second type of error is corrected when the 
receiver buffer becomes empty. In the first case, if the receiver buffer 
has more samples than it should and is driven into overflow (the 
transmitter buffer becomes empty), the excess samples can simply be 
discarded and the receiver and transmitter are again realigned. Since 
this occurs during a condition of low speech activity or silence, the loss 
of samples during this time is generally not audible. In the second case, 
when the receiver buffer is missing samples, the buffer will become 
empty prematurely during a period of high-speech activity (when the 
transmitter buffer fills up). In this case, zero-valued “dummy” samples 
can be inserted until realignment occurs between the transmitter and 
receiver. This silent period inserted during an active speech interval is 
not usually detectible by a listener. As a result, the realignment phases 
following an overflow or underflow error condition do not (in general) 
disrupt the audible speech. 

Figure 15 is an example of a simulation of error recovery. Figure 15a 
shows the speech waveform and Fig. 15b shows the receiver buffer 
status. At time a (Fig. 15b), a header error was encountered resulting 
in an excess number of bits in the buffer, indicated by the shaded 
regions. At time 0, during low speech activity, re-alignment with the 
transmitter buffer occurs. Following the error at a, no effects of the 
misalignment and realignment were audible to the listener. 


V. ADDITIONAL CONSIDERATIONS AND COMMENTS ON VARIABLE 
RATE CODING 


In this section, we examine a number of additional issues concerned 
with variable rate coding and comment on further directions and 
potential applications that need to be investigated. 


5.1 Interaction of prediction gain and rate distortion criteria 


In the coder example in Section IV (and the theoretical analysis in 
Section IIT), we have used the variance of the difference signal e(n) 
(see Fig. 10) in controlling the buffer feedback. In this section, we 
briefly show the relationship between this variance and the signal 
variance of the input signal x(n) for the case of a first-order predictor 
and demonstrate how the prediction gain interacts with the buffer 
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Fig. 15—Recovery of the buffer alignment after channel errors. (a) Speech waveform. 
(b) Receiver buffer status. 


control. The interaction between the first-order predictor gain and the 
buffer control is also examined. 
The differential signal e(7) is (see Fig. 10) 


e(n) = x(n) — ax (n). (22) 
After substituting the first-order correlation, 


_ <x(n)x(n — 1)> 
©" exn>” eo) 


the expected value of e?(n) becomes 
<e?(n)> = <x*(n) >- [1 — 2ac + a’). (24) 


The result is that the difference signal variance is equal to the input 
signal variance multiplied by a factor dependent upon the signal 
correlation. Typically, for voiced speech, c corresponds to a signal 
correlation on the order of 0.9 (depending on the sampling rate) and a 
typical value of a might be about 0.9 for a fixed predictor. The result 
is that, for voiced speech the variance <e’(n)> is approximately 
proportional to the input variance, i.e., 


<e?(n)> |voiced = 0.2 < x*(n)>. (25) 
speech 
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During unvoiced speech, the signal correlation c is on the order of 0.1 
and 


<e?(n)> | unvoiced = 1.6 < x (n)>. (26) 
speech 


A comparison of the input signal variance <x?(n)> and the difference 
signal variance <e*(n)> is shown in Figs. 16b and 16c, respectively. 
Both variances appear to track relatively closely during voiced regions. 
During the unvoiced sounds, as for example /t/ in the word “tool,” 
this loss of prediction increases the variance <e’(n)>. Since the bit 
allocation is based on <e?(n)> = o’(n), this implies that a larger 
number of bits will be used to encode these unvoiced regions where 
the prediction gain becomes low but where the input signal variance 
is still significant. 


5.2 Alternative criteria for buffer control based on code word magnitude 


Throughout this paper, we have assumed that the buffer control is 
driven by the signal variance o”(n) which is a result of the application 
of rate distortion theory. From the point of view of speech quality and 
perception, however, it is not clear that signal variance is the most 
appropriate parameter to be used for driving the bit allocation and 
buffer control.”'° Other, more perceptually meaningful parameters 
might be used as a driving function to produce better performance for 
speech. 

In this section, we allude to one alternative candidate for this driving 
function based on a short-time average of the “code word energy.” 
This function has been shown to be a sensitive indicator of speech and 
nonspeech activity.'”’® The short-time, code-word energy is defined as 


n 


E(n) = 2 q+ | elm)|, (27) 
where J is the number of samples over which the code word energy is 
averaged, c(m) is the code word at time m (see Fig. 10), and gz is a 
scale factor which normalizes the code words for different numbers of 
bits/sample. The code word c(m) is more specifically defined as the 
quantizer level (see Fig. 11) expressed as an integer. The presence of 
small-magnitude code words are associated with silence, and the 
presence of large-magnitude code words are associated with speech. 

Figure 16d illustrates an example of the short-time code word energy, 
E(n), for parameters J = 80, and g2 = q3 = qa = Gs = 1. It can be seen 
that £(n) provides a more sensitive indication of speech activity than 
<x’(n)> or <e”(n)> and therefore may be a perceptually more desir- 
able driving function to use for bit allocation and buffer control. It also 
provides a more reliable indication of when silence occurs in the 
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Fig. 16—(a) Input speech waveform. (b) input signal variance <x*(n)>. (c) Difference 
signal variance <e?(n)>. (d) Short-time code word energy E(n). (e) Speech waveform 
with silence decision based on code word energy. 
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sentence.'”’® For example, Fig. 16e shows the speech waveform x(n) 
with the silence regions set to zero. It was not possible to distinguish 
between this sentence and the original in Fig. 16a when listening. This 
silence detector feature may be useful, particularly for a multiple-user 
application where some users can be turned off during silence. 


5.3 Multiple user (TASI) applications of variable rate coding 


In Section II, we illustrated a block approach for implementing 
variable-rate coding with multiple users and have pointed out that 
TASI-type advantages can be gained with this approach. A similar 
approach can also be implemented using dynamic buffering for each 
speaker. This idea is intuitively appealing in the sense that it couples 
TASI advantages with variable-rate coding advantages, i.e., it is a TASI 
with memory. By buffering the inputs of the speakers, bursts of strong 
activity from some speakers can be time-aligned with micro-silence 
regions of other speakers. Speakers whose buffers are full can receive 
short-time priority over other speakers whose buffers are not full. 
Thus, the statistics of speech activity seen by the channel is a combi- 
nation of activity over time as well as across speakers. Flexible tradeoffs 
should be possible between the size of the input buffers (i.e., time 
delay) and the number of allowed users in the system: 


VI. CONCLUSIONS 
In summary, we can draw a number of conclusions concerning 
variable-rate coding: 


(t) A block processing analysis shows that, for a single user, the 
improvements in block s/n of a variable-rate coder over that of 
a fixed-rate coder are dependent on the nonstationarity of the 
source and are related to the ratio of the arithmetic-to-geomet- 
ric means of the signal variance. 

(vt) For a single speech source, block sizes greater than about 100 
ms are required before any substantial improvement over fixed- 
rate coding can be realized. Alternatively, flexibility in trans- 
mission rate is obtainable with very short block sizes with no 
loss in performance over fixed rate coding. 

(zzz) A multiple user variable-rate coding offers an interesting ap- 
proach to implementing a TASI system. 

(tv) Practical methods exist for designing variable-rate coders, and 
they can be made to be robust to channel errors. 
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In a recently proposed communication system, there would be 
tandem connections of 16 kb/s delta modulators and 2.4 kb/s vocod- 
ers. Preliminary work has indicated that such tandem links would be 
of substantially lower quality than etther the delta modulator link or 
the vocoder link alone. The present study, which includes an elabo- 
rate subjective speech quality experiment, confirms this preliminary 
conclusion. It also shows that two other differential waveform coders 
are no better than the proposed delta modulator in tandem links. On 
the other hand, a 5-band sub-band coder does offer substantially 
higher quality than the delta modulator. Still, its performance in 
tandem with the vocoder is poorer than that of the vocoder or the 
sub-band coder alone and is probably of only marginal value for 
practical communication. We have obtained several objective mea- 
sures of speech quality which, for the most part, show relatively little 
correlation with subjective quality. The most successful objective 
predictor of subjective ratings is a linear combination of linear 
predictive coding distances. 


I. BACKGROUND AND INTRODUCTION 
1.1 Background 


Recent plans for United States government digital communication 
networks have focused attention on the compatibility of 2.4-kb/s 
(narrowband) vocoder systems and 16-kb/s (wideband) waveform cod- 
ing schemes. An important question arises in the implementation of 
such a system: If both narrowband and wideband systems are designed 
to provide adequate speech communication individually, will a tandem 
connection of them also function adequately? 
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A recent study of this question,” * using signal-to-noise ratio (s/n) 
and a spectral distance measure as criteria of merit, has cast doubt on 
the viability of circuits containing a 2.4-kb/s LPc (linear predictive 
coding) vocoder and a 16-kb/s Cvsp (continuously variable slope delta 
modulation) waveform coder. In that study, it appeared that CvsD was 
the weak link in these tandem connections. However, the conclusions 
could only be regarded as tentative because the speech material 
included in the study was very limited and because the relationship of 
the objective performance measures to the quality of communication 
experienced by human users was by no means evident. 


1.2 Aims 


In the work reported here, we extend previous results by: 

(i) Studying three 16-kb/s waveform coders in addition to Cvsp. 

(iz) Presenting subjective as well as objective performance mea- 
sures. 

(uz) Greatly enlarging the variety of speech material processed by 
the various communication systems. 

The specific questions addressed in our study are: 

(zt) What is the subjective quality of tandem connections of nar- 
rowband and wideband systems, relative to the quality of 
individual systems? 

(uz) Are there alternatives to cvsp that offer higher quality in either 
(or both) individual or tandem performance? 

(zit) What is the relationship of objective measures of system per- 
formance to subjective assessment of speech quality? 


71.3 An experiment 


To answer these questions we produced, in software on a Data 
General Eclipse computer, a 2.4-kb/s LPc vocoder and four different 
16-kb/s waveform encoders. They are: 

(x) The cvsp studied in Refs. 1 and 2. 

(zz) A double integration version of Cvsp, which we call ADM (adap- 

tive delta modulator). 

(zit) A two-bit 8 kHz appcM (adaptive differential PCM). 

(tv) sBC (sub-band coding) with five separate channels spanning 

the 200 to 3200-Hz band of speech energy. 
Relative to CvSD, ADM has virtually the same circuit complexity 
(requiring only one additional resistor and one capacitor), ADPCM is 
perhaps 2 to 3 times as complex, and SBC is approximately 10 to 20 
times as complicated. 

The five coding schemes (four waveform coders and LPC) were used 
in 13 different communication systems (five coders individually, four 
waveform coders preceding LPC, four waveform coders following LPc). 
These systems processed a total of 148 speech samples from four 
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talkers (two male and two female) at three different power levels 
(spanning a 30-dB range). 

Twenty-two subjects rated each of the processed speech samples on 
a 9-point scale. Kach sample consisted of one sentence of 2 to 3 seconds 
duration, and no sentence was heard more than once by any individual 
subject. Although the subjects were asked to rate overall speech 
quality, it is felt that intelligibility had a greater influence over their 
ratings than it does in experiments in which a few sentences are 
repeated many times. 

In addition, four different objective measures of system aualieys were 
calculated. These include the s/n and spectral distance measure used 
in Refs. 1 and 2, and also two segmental signal-to-noise ratios’ that 
have been shown in other work to be more closely related to subjective 
quality than s/n.* 

Statistical analysis of the subjective data reveal a complicated 
pattern of interactions among the experimental variables. The relative 
performances of the various coding schemes are dependent in a com- 
plicated way on talker and on input level as well as on (individual or 
tandem) system configuration. In spite of the complicated dependence 
of subjective quality on physical conditions, clear patterns in the data 
emerge to answer our original questions. 

Among the individual circuits, ssc has on the average the highest 
subjective quality, followed by LPc, CVvSD, ADM, and ADPCM, in that 
order. Tandem connections all are substantially degraded relative to 
individual circuits. Among the waveform coders, SBC provides the best 
tandem connections with Lpc, but the sBc-Lpc tandems are substan- 
tially worse than either individual system. The tandems involving the 
other waveform coders are probably inadequate for effective speech 
communication. 

Among the objective measures, s/n as in other studies” * was found 
to be very poorly correlated with subjective quality. Moreover, with 
the diversity of circuit conditions and speech material presented here, 
the segmental signal-to-noise ratios were also of little use in predicting 
subjective quality. The spectral distance measure was the only one 
that was somewhat useful: a linear regression model based on distance 
measures of both tandem links and on overall distance accounted for 
60 percent of the variance in the average ratings. 


li. SYSTEM DESCRIPTION 
2.1 Overview 

In the narrowband-to-wideband tandem link shown in Fig. 1, the 
input speech appears as 16-bit PCM with 8-kHz sampling rate. It is first 
bandpass filtered to a bandwidth of 200 to 3200 Hz by means of a 6th 
order elliptic bandpass filter. It is then vocoded by the LPc vocoder. At 
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the output of the vocoder, the sampling rate is converted (if necessary) 
by digital techniques’ to the sampling rate of the waveform coder. This 
conversion has no effect on the tandem connection and is virtually 
“transparent” in terms of quality. The gains G and 1/G before and 
after the coder are used in measuring the dynamic range (i.e., variations 
in performance as a function of signal level) of the waveform coder. 
The output of the coder is lowpass filtered to 3200 Hz, and its sampling 
rate is converted back to 8 kHz and the output signal is processed by 
a 3200-Hz lowpass filter. In Fig. 2, the same signal processing operations 
are shown with the ordering that provides a wideband-to-narrowband 
connection. 


2.2 The narrowband system (LPC) 


The narrowband system consists of a linear predictive coding 
(LPC) system based on an all-pole model of the speech production 
mechanism. The all-pole model implies that, within a frame of speech, 
the output speech sequence is approximated by 


Pp 
Sn = > QARSn-—k + G'Un, (1) 
k=] 
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Fig. 1—Narrowband-to-wideband system. 
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where p is the number of poles, u, is the appropriate input, G’ is the 
gain, and the a;,’s are the LPC coefficients that represent the spectral 
characteristics of the speech frame. For a voiced speech segment, u,, is 
a sequence of pulses separated by the pitch period. If the segment is 
unvoiced, pseudorandom white noise is used as input. 

In our study, the Lpc coefficients were calculated by the autocorre- 
lation method with p = 12. The analysis was performed every 20 ms 
(50 times/s) with a variable analysis frame size. The frame size was 
proportional to a running average of the pitch period as obtained at 
the pitch detector output.° A Hamming window was used prior to the 
LPC analysis. Pitch detection and voiced-unvoiced (V/U) analysis were 
done using the modified autocorrelation method. 

For quantization purposes, the LPc coefficients were converted to 
log area ratio coefficients, which were coded by means of ADPCM 
techniques.” An overall bit rate of 2.4 kb/s was obtained by allocating 
48 bits to each of the 50 frames per second. Details of the encoding 
scheme are given in the references. 


2.3 Delta modulators, CVSD and ADM 


The experiment includes two delta modulators, CvsD, and a double 
integration version of cvsD which we refer to as ADM. Both of them 
can be represented by the block diagram in Fig. 3. Their principal 
difference is in the nature of the signal feedback path which is a single 
integrator in CvsD and a double integrator in ADM. 

In both coders, the step size voltage can be generated by an RC 
integrator as described in Ref. 1. The integrator input depends on the 
three most recent output bits. If they are identical, the step size 
increases; otherwise, it decreases. The adaptation equation is 


Ar+i = BA + (1 — B)(Ve + Amin), (2) 


where 
A; is the step size at the kth sampling instant, 


OVERLOAD 


DETECTOR 





Fig. 3—Block diagram of the cvsp and ADM coders. 
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B = 0.99 is the step size leakage constant corresponding to an RC 
time constant of 6.4 ms, 
Amin is the minimum step size, and 
Vi = Amax When the three most recent outputs are identical and 
V, = 0 otherwise. 
The dynamic range of the coder is determined by Amax/Amin, which is 
150 (44 dB) in the cvsp and 256 (48 dB) in the ApM. 


2.3.1 CVSD 


As in Ref. 1, the signal feedback loop is a single integrator with a 1- 
ms time constant. The difference equation is 


Vr+i = 1 + AH (1—a1)b;zAz, (3) 


where 
yp Is the integrator output at the kth sampling instant, 
a; = 0.94 is the integrator leakage constant, corresponding to an RC 
time constant of 1 ms, 
H = 3 is the integrator gain, and 
b,; = +1 is the Ath output bit. 
In the cvsp, Amax = 2 dBm, which places the center of the coder 
dynamic range near —21 dBm, the central value of the three signal 
input levels used in the experiment. 


2.3.2 ADM 
The double integration version of CvsD was selected for evaluation 
after a large number of other delta modulators were simulated. The 
other delta modulators differed from CVvsD in one or more of the 
following respects: 
(zt) An exponential expandor was used in the step-size feedback 
loop to produce the step size 


Ane = exp (Ar+1), 


where Ag+; is given by (2). This changes the adaptation from 
essentially additive to essentially multiplicative. 

(tt) The most recent two bits rather than the most recent three bits 
were used to determine whether the step size would increase or 
decrease. 

(iit) The signal feedback loop contained a double integrator rather 
than a single integrator. 

A limited amount of speech material was processed to evaluate these 
modifications. Segmental s/n was measured for each delta modulator 
configuration and, although some modifications resulted in better 
performance than cvsp for certain input levels, none of them produced 
substantially better results either in terms of dynamic range or peak 
segmental s/n. However, to provide one delta modulation alternative 
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to CVSD, we chose the double integration ADM. Double integration is 
known to enhance performance significantly at higher sampling rates 
and to be essentially ineffective in 8-kHz DPpcm (see Section 2.4). Our 
purpose here was to assess the effectiveness of a particular double 
integrator in a coder with 16-kHz sampling. 

The double integrator in this coder is a second-order FIR filter that 
conforms to the block diagram in Fig. 4. The difference equations of 
the integrator are 


Un+1 = Vr + H(1 — a — a2) b,Az (4) 
Vee = QyUgs+i + AU, (5) 


where 
u, is the decoder output, 
yp is the output of the encoder feedback loop, 
a, = 1.38, a2 = —0.43 are the filter coefficients, and 
H = 10 is the gain. 
The z-transform of the integrator is 


a2 1(1 — ¢327') 
(1 — e127") (1 — c227')’ 


(6) 


where the integrator poles are related to the filter coefficients by 





Cr+ C2= a CiC2 = —aQ2 (7) 
and the zero is 
C3 = —a2/a). (8) 
The corresponding real poles and zero of the filter frequency response 
are 
i= — cos"! a “| (9) 
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Fig. 4—Double integrator circuit for the ADM coder. 
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where T = 1/16000 s in a 16-kb/s delta modulator. With ai = a2 = 1.38, 
—0.43, the pole frequencies are 200 and 2000 Hz, and the zero is at 3500 
Hz, so that the integrator frequency response is approximately that 
shown in Fig. 5. In the ADM, Amax = —5 dBm, which approximately 
centers the coder dynamic range at —21 dBm. 


2.4 ADPCM 

Figure 6 shows the block diagram of the ADPCM system. In an error- 
free environment, the primed quantities at the receiver are equal to 
the corresponding ones at the transmitter. In the encoder error sample 
e(k) is generated as the difference between the input speech sample 
s(k) and a predicted sample s(k). After quantization with 2 bits/ 
sample, the prediction error at both receiver and transmitter is added 
to the predicted sample to give the reconstructed sample r(k). The 
predicted sample s(k) is derived from the previous reconstructed one, 
r(k—-1), by a first-order transversal predictor: 


S(k) = 0.78r(k — 1). (10) 


The coefficient 0.78 was computed by the usual mean-square error 
minimization technique” under the hypothesis of an overall s/n of 
about 10 dB. 

The 2-bit coding of the prediction error is effected by means of the 
adaptive step size A(k), which is computed as proportional to a short- 
time estimate o(k) of the absolute magnitude of the quantizer input. 
The estimate o(k) is computed recursively from the decoded prediction 
error d(k) so that no side information has to be transmitted. The 
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Fig. 5—Frequency response of the double integrator circuit. 
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Fig. 6—Block diagram of the ADPCM coder. 
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entire adaptation process is summarized by the following two equa- 
tions: 


A(k) = Co(k) (11) 
o(k) = ao(k — 1) + (l—a) | d(k—-1)|. (12) 


In eq. (11), the parameter C, the load-constant, determines in the 
steady state the magnitude of the average step-size and hence the 
amount of granular noise and overload distortion. In eq. (12), the 
parameter a determines the speed of response of the adaptation 
algorithm to input level changes: a relatively small value of a produces 
fast response but an inaccurate estimate in steady state. 

For simulating a practical Implementation, the step size A(k) was 
constrained to assume values in the range (Amin, Amax) with 


Avnax 


min 


= 256. (13) 


Values of C, a, and Amin were calculated by optimizing a prediction of 
the subjective opinion score, obtained from separate measures of 
granular noise and overload distortion. The integrator coefficient was 
a = 0.875, corresponding to a time constant of 1 ms. The minimum 
step size which produced the same degradations at the high and low 
end of the input level range of interest was found to be —63 dBm. The 
maximum step size is —15 dBm, while the rms speech input level 
assumes values in the range —36 dBm to —6 dBm. 
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2.5 The sub-band coder 


Sub-band coding is a waveform coding technique in which the 
speech band is partitioned into typically 4 or 5 sub-bands by bandpass 
filters. Each sub-band is then lowpass-translated to dc, sampled at its 
Nyquist rate, and then digitally encoded using adaptive Pcm (APCM) 
encoding. By this process of dividing the speech band into sub-bands, 
each sub-band can be preferentially encoded according to perceptual 
criteria for that band. On reconstruction, sub-band signals are decoded 
and bandpass-translated back to their original bands. They are then 
summed to give a replica of the original speech signal. 

A particularly attractive implementation of the sub-band coder in 
terms of hardware is based on an integer band sampling approach.’~” 
With this approach, the modulations to lowpass at the transmitter and 
to bandpass at the receiver are inherent in the sampling process. The 
implementation is illustrated in Fig. 7. Bandpass filters BP, to BPny in 
the transmitter and receiver serve to partition the input speech into N 
sub-bands. The coders and decoders encode the sub-band signals and 
the multiplexer combines these digital signals and synchronizing bits 
into a single bit stream for transmission over the digital channel. 

Table I shows the choice of bands and bit allocations used in the 16 
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Fig. 7—Block diagram of the sub-band coder. 


Table |I—16 kb/s 5-band sub-band coder 


Band Sampling Min. 

Edges Freq Step-size Bit 
Band (Hz) (Hz) (dB) Allocation Kb/s 
1 178-356 356 (Ref) 4 1.42 
2 296-593 593 0 4 2.37 
3 533-1067 1067 0 3 3.20 
4 1067-2133 2133 —3 2 4.27 
5 2133-3200 2133 ~§8 2 4,27 
Sync 0.47 
16.00 
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kb/s coder. The coder is a 5-band design which was proposed in Ref. 
11. Column 2 shows the frequency range covered by each sub-band. 
The bit allocation refers to the number of bits/sample used by the 
coders in each sub-band. As seen from the table, more accuracy is 
allowed for encoding the lower bands for reasons explained in Ref. 11. 

The frequency range of the coder extends from 200 to 3200 Hz. A 
plot of the frequency response, shown in Fig. 8, reveals small notches 
at 1067 and 2133 Hz. These notches are due to the transition bands of 
the filters in bands 4 and 5. Subjectively, they are not very perceptible. 
Bands 1 to 3 are overlapped to avoid such notches at lower frequencies. 
The filters are sharp cutoff, 200 tap, FIR filters. 

Column 4 in Table I contains the minimum step sizes of the APCM 
coders, expressed in decibels, relative to the minimum step size of band 
1. This choice of minimum step sizes is different than that suggested 
in Ref. 11 and was found to give a better matching of the dynamic 
ranges of the sub-bands. 


ll. OBJECTIVE MEASUREMENTS 


Four different objective measurements were made on the waveform 
coders. They are conventional signal-to-noise ratio, SNR, two types of 
segmental signal-to-noise ratios, SEG1 and SEG2, and an LPC spectral 
distance measure, D. In addition, D was used to evaluate the perform- 
ance of the LPc vocoder and the tandem connections of the waveform 
coders and the LPc vocoder. In this section we briefly define each of 
these objective measures. 
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Fig. 8—Frequency response of the sub-band coder. 
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3.1 Conventional SNR 


The most commonly used measure of performance of digital coders 
has been the conventional signal-to-noise ratio evaluated over an 
utterance of speech. The speech power is defined as 


p=>x*(m), (14) 


and the noise power is defined as 


n= ¥ (x(m) — y(m))’, (15) 


where x(m) and y(m) are the input and output signals of the coder, 
respectively, and the summations in (14) and (15) are taken over the 
entire speech utterance. The conventional s/n is then defined as 


SNR = 10 log(p/n). (16) 


In measuring the input and output signals of the coders, it is generally 
desirable to compensate for the effects of lowpass or bandpass filtering. 
This is done by the arrangement shown in Fig. 9. The input speech 
signal s(m) is coded to form the output speech signal y(m). It is also 
filtered with the same filters used in the coder to generate a compen- 
sated reference signal x(m) which is used as the input signal in (14) 
and (15). SNR is, therefore, strictly a measure of coder distortions and 
is not affected by bandlimiting or group delay in the filters. 


3.2 SEG! 


While snr is the most widely used criterion in measuring coder 
distortion, it has also long been known that in many situations it does 
not correlate well with subjective performance.’ Another definition of 
signal-to-noise ratio, however, recently proposed by Noll,® does appear 
to correlate better with subjective performance. This measure is based 
on s/n measurements made over segments of speech which are typi- 
cally about 20 ms in duration. An average over all of the segments in 
the speech utterance is then taken to obtain a composite measure of 


CODER 
WITH FILTERS | yim) 


CODER 
FILTERS 
(NO CODING) 





INPUT 
SPEECH 







S/N 
MEASURES 







x(m) 


Fig. 9—Circuit for measuring signal-to-noise ratios. 
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performance for the entire utterance. If (s/n); corresponds to the 
signal-to-noise ratio in decibels for a segment, 1 [computed in the same 
manner as in (16) ], the segmental s/n (SEG1) is then defined as 


N 
SEG1 ue y; (s/n); (dB), (17) 
N ix 
where it is assumed that there are N 20 ms segments in the speech 
utterance. 

Problems arise in this definition of segmental s/n when intervals of 
silence exist in the speech utterance. In segments where the input 
signal x(7) is essentially zero, any slight noise will give rise to large 
negative (s/n);, and these segments may unduly dominate the average 
in (17). To prevent this anomaly, we first identify those segments 
which correspond to silence and exclude them from the average in 
(17). This is achieved by means of a simple threshold. Let p,; represent 
the (average) speech energy in a segment, 2, so that 


1 K 
Di =F y x*(m), (18) 


where K corresponds to the number of speech samples in the segment. 
Then the segment will be included in the computation of SEG1 in (17) 
if its energy exceeds a threshold T,, 1.e., if p; > T. If it does not exceed 
this threshold, it is not included in the average in (17). Furthermore, 
to prevent any one segment from dominating the average we also limit 
the value of (s/n); to a range of —10 to +80 dB. That is, —10 < (s/n); 
< 80 dB. In computer simulations, the 16-bit wordlength admitted 
signal levels in the range +32767 and we set T to 900, corresponding to 
—55 dBm. 


3.3 SEG2 


The definition of this measure is 
1 N 
SEG2 = N Y. 10 logio(1 + pi/ni) (dB), (19) 
i=] 


where p; is the signal power in segment 7 and 77; is the noise power in 
segment i. They are defined (on segments) according to eqs. (14) and 
(15), respectively. 

Unlike sEG1, SEG2 does not have any thresholds. It is self-limiting to 
a lower value of 0 dB due to the addition of the constant 1 inside the 
logarithm. As in the SEG1 measure, SEG2 uses 20 ms segments. 


3.4 LPC distance measure 


The fourth objective measure was the LPC distance proposed by 
Itakura."* The distance between two frames of speech with LPc coef- 
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ficient vectors a and a, and with autocorrelation matrices V and V is 
defined as 


aVa | (20) 


D, = d(a, a) = log Re 
where a and 4, are (p + 1) component vectors and V and V are (p + 
1) X (p + 1) matrices, where p is the order of the LPc analysis. 
D, is a measure of the distance between frames of speech. This 
distance, however, does not satisfy exactly all the properties of a true 
distance metric, i.e., 


d(a, a) # d(a, a). (21) 


However, for cases when d(a, &) 1s small, the inequality of eq. (21) is 
almost an equality. To compensate for this lack of symmetry, it is 
convenient to define a second distance, Dz, as 


ava’ 
Dz = d(a, a) = log | —;> 22 
2 = d(a, a) °|Re (22) 
and an average distance between the two frames is now given by 


D, + D2 


D= 5 


(23) 
Equation (23) defines a true distance metric which can be used to 
measure the distance (dissimilarity) between two frames of speech. It 
can readily be shown” that either D; or Dz can be expressed in terms 
of spectral differences between the LPc models for the two frames of 
speech. 

D, and D2 were measured for every utterance used in the tests to be 
described later. They were measured on a frame-by-frame basis and 
averaged across the entire utterance to give an overall LPc distance 
between the original and processed version of a sentence. Figure 10 
shows the system used to measure LPC distance for a single coder. The 
box labeled delay was used to compensate any delay inherent in the 
coder, and the bandpass filters were used both to eliminate out-of- 


200-3200 
BPF 
200-3200 
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Fig. 10—Circuit for measuring LPC distance measures. 
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band quantization noise generated in the coder and to guarantee that 
the bandwidths of both the original and coded utterances were the 
same. 


3.5 Comparison of SEG and D 


Figures 11 and 12 show a series of plots for two of the utterances 
used in the experiments (encoded with the ADPpcM coder). Part a of 
each figure shows the rms energy of the utterance as a function of 
time (frame number), part b of each figure shows the segmental s/n 
versus frame number, and part c of each figure shows the LPc distances 


5 


| Manel 


ENERGY (RMS) X 103 


SNR SEG IN DECIBELS 


DISTANCE 





50 


FRAME NUMBER 


Fig. 11—Objective measurements as a function of time for utterance A. (a) rms energy 
of the input signal. (b) Segmental s/n-sEa1. (c) LPc distance. 
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Fig. 12—Objective measurements as a function of time for utterance B. (a) rms energy 
of the input signal. (b) Segmental s/n-szG@1. (c) LPc distance. 


(both D; and D2) versus time. The utterance of Fig. 11 had an average 
Lec distance of about 0.60, whereas the utterance of Fig. 12 had an 
average LPC distance of 0.97. It can be seen in both figures that most 
of the time D, ~ D2; however when either D: or D2 was large, the 
differences between D, and D2 were often significant. It can also be 
seen in these figures that the Lpc distance and the segmental s/n do 
not measure similar types of distortion—i.e., when the segmental s/n 
is small (indicating temporal distortion of the waveform) the LPC 
distance is not necessarily large (indicating spectral distortion of the 
signal). Finally, it can be seen that a large degree of variation (on a 
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frame-by-frame basis) occurs with both the segmental s/n and the LPc 
distance. Thus, a single number which characterizes the “distortion” 
across the entire utterance may have little meaning 1n many cases. 


IV. EXPERIMENTAL DESIGN 
4.1 Circuit conditions 


The experiment tested 37 different communication circuits, each 
characterized by three parameters: direction, coder, and level. There 
were three directions: (z) single link with a waveform coder or vocoder 
alone, (ii), LPC-to-waveform, as in Fig. 1, (tt) waveform-to-LPC as in 
Fig. 2. There were five different single links, four with waveform coders 
and one with a vocoder. Each waveform coder was tested with speech 
at three different input levels, —36 dBm, —21 dBm, and —6 dBm. The 
corresponding settings of the gain parameter, G, were 0.178, 1.00 and 
5.62, respectively. The speech level at the vocoder input was always 
—21 dBm. Thus, there were 13 single link configurations, in all. Each 
of the other two directions had 12 circuit configurations, comprising 
all combinations of four waveform coders and three input levels. 


4,2 Speech material 


For this experiment, a substantial digital speech hbrary was pre- 
pared. Four talkers, two male and two female, read 40 different 
sentences, 2 to 3 seconds long, each talker reading from a different 
phonetically balanced list. The talkers were seated in a sound-proof 
booth and spoke into a high-quality dynamic microphone. The ampli- 
fied microphone signal was lowpass-filtered at 3.2 kHz, sampled and 
converted into digital form by a 16-bit A/D converter operating at 8- 
kHz sampling frequency and finally written onto a magnetic disk. All 
the sentences were digitally equalized to the mean power level of —21 
dBm. 

For each of the 37 circuit conditions, sentences spoken by each of 
the four talkers were processed, generating a total of 148 stimulli. 
Different sentences were used in each case so that in the set of 148 
stimuli the same sentence was never heard twice. With this format, we 
speculate that intelligibility of the processed speech played an impor- 
tant role in determining quality judgments. In tests containing a few 
sentences, presented many times, each sentence becomes recognizable 
to subjects even in conditions severe enough to make it quite unintel- 
ligible at first hearing. It is our hypothesis that, in such tests, there is 
a lower correlation between intelligibility and subjective quality than 
in the tests reported here. 


4.3 Procedure 


The 148 stimuli were recorded in different random orders on 4 
analog tapes. Twenty-two students from the junior and senior classes 
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of local high schools served as paid subjects. They listened to the 
processed speech monaurally over Pioneer SE 700 earphones at 80 dB 
SPL while seated in a double-walled sound booth with frequency- 
weighted room noise introduced at a level of 50 dBA. The total 
listening time for each group of subjects was about 30 minutes, with.a 
short break after the 80th sentence. After each stimulus, the subjects 
had 4 seconds for recording their judgments. They were instructed to 
rate the quality of the stimulus by checking on their answer sheets a 
value between 1 and 9, using 1 for the worst conditions, 9 for the best 
ones, and intermediate numbers for intermediate qualities. Before the 
actual test, the subjects listened to 12 practice sentences, different 
from those used in the experiment, spanning the range of quality in 
the experiment. 


V. SPEECH QUALITY RESULTS 


Variability in the subjective and objective measures of quality of the 
148 processed speech samples can be attributed to several (variable) 
sources, namely: 

(t) The “direction” of the circuit: LPc-to-waveform coder, wave- 
form-coder-to-LPC, or single link. 
(it) The waveform coder. 
(zzz) The speech level at the input to the coder. 
(tv) The talker. 
(v) The sentence. 
(ut) The listener (subjective data only). 
(vii) Inconsistency of each listener (subjective data only). 
We are primarily interested in how the first two variables, circuit 
direction and waveform coder, influence quality. Inferences about 
these variables would be simple if they accounted for most of the 
variance in the data or if they did not interact substantially with the 
other variables. Unfortunately, neither of these conditions is met by 
our data, and many of our inferences about circuit direction and 
waveform coder will be more qualified than we would like them to be. 


5.1 Subjective data 
5.1.1 Listeners 


The amount of listener agreement was fairly low relative to other 
speech quality experiments.*”’ For each pair of listeners, we computed 
the correlation coefficient of the 148 ratings. The median of the 
correlations was only 0.49. The 25th and 75th percentiles were 0.41 
and 0.60, respectively. With respect to the 148 mean ratings (averaged 
over the 22 listeners), individual listener correlations ranged from 0.50 
to 0.85, which suggests that no subject was very idiosyncratic in his 
ranking of stimulus conditions. 


618 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1979 


Figure 13 gives plots of the rating scores of each of the 22 subjects 
for the Lpc system alone and for each of the four talkers. The large 
variability among subjects is readily seen. For example, for talker 3 
the average rating was 7.3. However, two subjects gave this circuit 
rating of 1 (the lowest possible), whereas 13 subjects gave it a rating of 
8 or 9 (the highest possible). Similar variability was found in the scores 
for almost every test condition. 

The 148 listener averages are presented in Table I], where we also 
provide aggregates of these averages across input level and talker. The 
aggregated mean values show the overall effects of circuit direction 
and waveform coder. 


5.1.2 Sentences 


In many subjective testing experiments, listeners hear one or a few 
sentences repeatedly. To achieve closer conformity to practical com- 
munication situations, a different sentence for each stimulus condition 
was used. A disadvantage of this design is the lack of any control for 
or means of testing the effect of sentence content on the quality 
measures. The variability due to sentences appears in and enhances 
the experimental error; 1.e., the variance that cannot be accounted for 
in statistical analyses. 


TALKER 1, MALE TALKER 2, FEMALE 


RATING 





1 10 20 30 1 10 20 30 
SUBJECT NUMBER SUBJECT NUMBER 


(a) (b) 


TALKER 3, MALE TALKER 4, FEMALE 


RATING 





1 10 20 30 1 10 20 30 
SUBJECT NUMBER SUBJECT NUMBER 


(c) (d) 


Fig. 13—Rating scores as a function of subject for the individual LPc circuit for each 
of the four talkers. 
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5.1.3 Talker effects 


Averaged over all 37 circuit conditions (combinations of input level, 
coder, and direction), the ratings of speech from the two male talkers 
were 4.98 and 4.94. The averages for the two females were 3.99 and 
4.00. A three-way analysis of variance (listener by talker by circuit 
condition) revealed a very significant talker effect. Clearly, this effect 
is predominantly due to listeners giving lower ratings to distorted 
female speech than to distorted male speech. However, there is also a 
substantial talker-circuit interaction, indicating that differences in 
ratings of male and female speech are by no means uniform across 
experimental conditions. (In fact, with fairly low distortion as in sub- 
band coding in a single link, the male and female averages are virtually 
the same—7.13 and 7.20, respectively.) This nonuniformity is evident 
in Table II which also reveals that, although the overall ratings of the 
two males are virtually identical, there are substantial differences from 
condition to condition in the ratings of the male voices, and likewise 
for the female voices. 


5.1.4 Input level 


The step sizes of the waveform coders were adaptable over a range 
of 44 dB (for the cvsp) or 48 dB (for the other three coders). With the 
rms input level varying over a range of 30 dB and individual sounds 
within a sentence exhibiting a wide range of levels, the weak sounds of 
the low-level signals were subject to greater-than-average granular 
quantizing noise, while the strong sounds of the high level sentences 
were susceptible to overload. The maximum and minimum step sizes 
of each waveform coder were chosen with the aim of centering the 
dynamic range of subjective quality in the —15 to +15 dB range of 
input levels. 

Table III shows that this design effort was entirely successful with 
the CvSD and ADPCM coders in which the dynamic range of subjective 
performance is exactly symmetric around the 0-dB input level. In the 
SBC and ADM coders, the overload distortion of the +15 dB input level 
was less harmful subjectively than the granular noise produced with 
the input set at —15 dB. In these coders, a better balance of granularity 
and overload would have been achieved with lower minimum step 
sizes. 


5.1.5 Coder and direction 


We have used the Tukey HSD criterion” to evaluate the relative 
merits of the 13 communication system configurations listed in Fig. 14. 
Figure 14a shows, for each circuit direction, groupings of coders for 
which the null hypothesis cannot be rejected at the 0.05 level. In all 
cases, SBC is superior to any of the other waveform coders. In the Lpc 
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Table Il—Average subjective ratings (over 22 listeners) 
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Table Ill—Average subjective ratings 
(over listeners, talkers and direction) 


Coder 
SBC CVSD ADPCM ADM 
—15 dB 5.0 4.1 3.9 3.6 
Level 0dB 5.5 4.1 4.3 4.3 
+15 dB 6.0 4.1 3.9 4.4 


SINGLE—-LINK LPC—WAVEFORM WAVEFORM—LPC 
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< cvsp > CVSD 
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LPC-WAVEFORM LPC—WAVEFORM WAVEFORM—LPC LPC—WAVEFORM 
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LPC—-WAVEFORM 





WAVEFORM—LPC WAVEFORM—LPC 


(b) 


Fig. 14—Relative subjective quality of coding systems. Circles indicate that it is 
impossible to reject the hypothesis that the coders have the same quality. 


— waveform circuits, ADM, CVSD, and ADPCM have essentially the same 
performance. In the waveform — LPc direction, ADPCM is better than 
ADM and CvSD, which exhibit essentially the same quality. 

Figure 14b shows the equivalent groupings across direction. The 
salient inferences from these groupings is that, for each waveform 
coder, the single link substantially outperforms either of the tandem 
connections. The two tandem directions have essentially the same 
quality when SBC, CVSD, or ADM is the waveform coder. The ADPCM 
— LPC tandem is significantly better than the Lpc — ADPCM tandem. 


5.2 Objective measurements of quality 


Results of the objective measurements discussed in Section III are 
presented in Tables IV and V. Table IV gives results for the perform- 
ance of the single-link circuits in terms of SNR, SEG1, SEG2, and LPC 


622 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1979 


Table !\V—-Objective measurements of single link coders 


Talker: M1 Talker: F1 
Level Coder SNR SEG1 SEG2 D_ Level Coder SNR SEG1 SEG2 D 
-15  sBC 13.2 11.9 8.7 0.82 -—-15  sBC 17.7 153 12.3 0.53 
0 SBC 14.4 13.4 8.8 0.50 O SBC 14.4 13.3 10.5 0.51 
+15 SBC 13.8 10.2 8.6 0.80 +15 SBC 13.2 14.9 12.0 0.34 
—15 cVvSsD 12.1 10.2 9.1 065 -15 cvsp 148 12.1 11.6 0.84 
0 cVSD 8.4 121 11.0 0.54 0 cVSsD 17.5 15.2 13.8 0.65 
+15 cVSsD 3.5 8.3 9.8 0.54 +15 cvsp 8.2 11.8 12.3 0.55 
—-15 appcm 126 128 103 0.54 -15 appcm 162 162 139 0.61 
O appcm 10.1 412.3 12.0 0.34 0 appcm 16.5 14.0 139 0.50 
+15 ADPCM 36 106 12.0 0.37 +15 ADPCM 8.2 126 13.1 0.58 
—-15 ADM 14.6 10.5 9.5 0.64 -15 ADM 16.5 14.7 12.6 0.89 
0 ADM 12.3 13.2 11.1 0.47 0 ADM 18.4 16.5 149 0.70 
+15 ADM 4.7 10.7 12.2 0.47 +15 apm 10.4 12.7 13.0 0.48 
— LPC 0.31 — LPC 0.38 
Talker: M2 Talker: F2 
Level Coder SNR SEG] SEG2 D_ Level Coder SNR SEGI1 SEG2 D 
—-15 sBC 12.3 10.6 7.66 0.76 -—15 SBC 146 14.1 12.0 0.60 
0 SBC 13.8 145 108 0.388 0 SBC 13.4 141 115 0.33 
+15 SBC 12.3 14.1 11.1 0.27 +15 SBC 12.2 138 12.1 0.28 
—15 cvsD 12.0 9.9 8.1 081 -—-15 cvsp 12.8 11.5 10.0 0.74 
~0 cVSD 9.4 9.6 9.4 0.43 0 cVvsD 11.1 149 12.5 0.62 
+15 cVvSD 4.4 99 11.5 0.43 +15 #£cVvsp : 11.3 12.1 0.40 
~15 aDPCM 12.8 13.0 105 050 -15 aprcm 14. 14.8 12.5 0.64 
O ADPCM 12.0 13.8 13.4 0.45 QO ADPCM 1 14.9 145 0.38 
+15 ADPCM 6.1 11.3 


—-15 ADM 14. 12.5 10.5 0.66 
1 


3 
5 
3 
5 11.2 12.6 0.67 
10.5 8.2 0.73 —-15 ADM 16.5 
17.8 
9 


4 
4 
4. 
13.0 0.47 +15 ADPCM 2 
7 
8 


4.1 
0 ADM 16 12.1 11.5 0.38 0 ADM 16.4 14.7 0.58 
+15 ADM 8.8 10.7 11.6 035 +15 apm : 13.1 12.6 0.51 
LPC 0.35 LPC 0.47 


Table V—Overall LPC distances for tandem links 


First Second 
Link Link Talker: M1 Talker: Fl Talker: M2 Talker: F2 
LPC SBC 1.18 0.93 0.83 0.81 
LPC SBC 0.20 0.73 0.672 0.66 
LPC SBC 0.61 0.57 0.46 0.56 
LPC CVSD 0.91 0.92 0.88 1.10 
LPC CVSD 0.66 1.00 0.54 0.91 
LPC CVSD 0.60 0.66 0.64 0.83 
LPC ADPCM 0.79 0.90 0.69 1.09 
LPC ADPCM 0.71 0.75 0.53 0.68 
LPC ADPCM 0.61 0.72 0.58 0.76 
LPC ADM 0.90 1.40 0.82 0.91 
LPC ADM 0.68 0.80 0.68 0.77 
LPC ADM 0.59 0.64 0.60 0.90 
SBC LPC 1.50 1.08 0.86 0.93 
SBC LPC 1.05 0.88 0.78 0.78 
SBC LPC 0.71 0.57 0.61 0.62 
CVSD LPC 0.91 0.86 0.98 1.21 
CVSD LPC 0.62 0.75 0.63 0.84 
CVSD LPC 0.62 0.82 0.56 0.89 
ADPCM LPC 0.75 0.76 0.71 0.79 
ADPCM LPC 0.63 0.80 0.67 0.87 
ADPCM LPC 0.51 0.74 0.60 0.76 
ADM LPC 0.79 0.71 0.86 1.06 
ADM LPC 0.58 0.81 0.64 0.80 
ADM LPC 0.47 © 0.62 0.56 0.69 
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distance D for each of the four talkers used in the experiment. Due to 
the large variability of the objective measures across talkers and 
sentences (a different sentence was used for each condition), it is 
difficult to make meaningful comparisons across conditions. A similar 
variability was observed for the objective measurements across indi- 
vidual coders in the tandem links. 

Table V gives results for Lpc distance for the overall tandem links. 
Again, a large variability 1s seen across conditions due to the different 
sentences used for each measurement. 


5.3 Relationship of subjective and objective measures 
5.3.1 Correlations 


Previous studies have demonstrated the inadequacy of SNR as an 
indicator of subjective quality and have pointed to segmental signal- 
to-noise ratio and to LPC distance metrics as more promising measures. 
In the present experiment, the diversity of speech material and of 
signal-processing approaches exceed those of previous studies, and 
thus the merits of single measures and combinations of measures as 
subjective quality indicators are tested more critically than ever before. 

Table VI shows correlations of average rating with each of the 
objective measures. The subscripts A, B, and AB, appended to SNR, 
SEG1, SEG2, and D, refer to measures taken on the first link of a tandem 
circuit (or the entire single-link circuit), the second link of a tandem 
circuit, and the overall circuit, respectively. 

Table VI indicates that the diversity of conditions either eliminates 
or dilutes the value of each of the measures as a predictor of speech 
quality. The table gives correlations of average rating (over 22 subjects) 
with each one of the objective measures. There are nine objective 
measures; 3 s/n’s and one LPc distance for each half of a tandem 
connection, and the overall Lec distance. Except for Da, the LPC 
distance of the first link, and Daz, the overall LPc distance, none of the 
measures is applicable to all conditions. (For example, s/n is measured 
only in the first link in the single-link and waveform-to-LPCc circuits. It 
is measured only in the second link in the LPc-to-waveform circuits.) 
In addition to the correlation, the table shows the number of data 
points used in the computation and the significance (two-tailed) of the 
null hypothesis that the coefficient is zero. 

It should be noted that, for all talkers, the only statistic for which 
the null hypothesis can be rejected at the 0.01 level is Daz, the overall 
distance. The two-tailed significance level for SEG2, is 0.001, but the 
correlation is negative. Surely a one-tailed test applies here, and the 
null hypothesis cannot be rejected. Computing correlations for ratings 
of male and female talkers separately, we see the same situation, 
except that SEG1z is significant at the 0.01 level as a predictor of male 
speech quality on the LPc-to-waveform tandems. 
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Table Vi—Correlations of average ratings with objective measures 


All Talkers 


No. of 
Conditions 


Signif. 
0.1 


Corr. 


—0.038 
—0.043 
—0.196 
0.233 
0.254 
0.510 
0.318 
—0.056 
—0.415 


Male Talkers 


No. of 
Conditions 


Signif. 


0.4 
0.4 
0.09 
0.02 
0.1 
0.005 
0.06 
0.4 
0.001 


Corr. 


0.014 
0.046 
—0.167 
—0.193 
0.382 
0.381 
0.031 
—0.086 
—0.709 


Female Talkers 


No. of 
Conditions 


48 
48 
48 
74 
24 
24 
24 
48 
74 


Signif. 
0.5 
0.4 
0.1 
0.05 
0.03 
0.03 
0.4 


0.001 


The poor correlations of practically all s/n measures with subjective 
quality has led us to abandon all of them as performance indicators of 
the tandem circuits and to focus our attention on the LPc distance 
measures. | 


5.3.2 Prediction of subjective quality 


Working with the Lpc distance measure for the first link of a tandem 
connection, Da, the distance measure for the second link, Dz, and Daz, 
the overall distance measure, linear regression procedures, were ap- 
plied to find formulas for predicting the average ratings, R , of the 148 
circuit conditions. The best linear combination of the three distances 
was 


R = —5.48D,4 — 6.47Dp + 2.52Dap + 7.38. (24) 


The standard deviation of the 148 mean ratings was 1.55 units on the 
9-point scale and the standard error of this regression was 1.10. The 
proportion of variance accounted for is thus 51 percent, and the 
multiple correlation coefficient is 0.712. 

The prediction accuracy can be improved somewhat by accounting 
for the fact that ratings and LPc distances are related differently for 
male and female talkers. We have done so by introducing a new 
variable, M, where M = 1 for male talkers and M = 0 for female 
talkers. Introducing M to the regression, we have 


R = —4.99Da — 5.98Dp + 2.14Dap + 0.48M + 6.85. (25) 


Here the standard error is 1.08, i.e., 53 percent of the variance is 
accounted for and the multiple correlation coefficient is 0.727. 

Various transformations of the distance data were also studied and 
a simple log transform proved useful in regression equations. We define 
the transform variables 


La = In(Da); Lap = In(Dap) 
and 
Lg = In(Dz) in tandem circuits and 
= —4.0 in single-link circuits. 


The value —4.0 has been chosen empirically. (It corresponds to a 
distance of 0.018. The lowest measured distance was 0.21, which was 
observed for several sentences processed by LPC.) 

Using the log-transformed distances, the regression equations cor- 
responding to (24) and (25) are 


R = -1.55L,4 — 0.785Lp — 0.211 Lap + 1.59 (26) 
and 
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R = -1.34L,4 — 0.782Lp — 0.0622L an + 0.643M + 1.51. (27) 


The standard error of (26) is 1.02, which accounts for 58 percent of the 
variance in average ratings and the multiple correlation coefficient is 
0.760. The corresponding statistics for (27) are 0.973, 62 percent, and 
0.785. 


VI. DISCUSSION 


These data analyses allow us to make generalized statements in 
answer to the three questions posed in Section 1.2. Owing to the 
interactions in the data, there are specific exceptions to many of the 
general conclusions of the following subsections. 


6.1 Quality of tandem connections 


A strong conclusion of the study is that any tandem connection of 
the vocoder is substantially worse than either of the two corresponding 
single links. Although we did not attach descriptive adjectives to rating 
categories, we have the impression that ratings below about 4.0 re- 
flected degradations severe enough to render a circuit inadequate for 
effective communication. 

In our judgment, the results of this experiment strongly suggest that 
a tandem connection involving any of the three differential waveform 
coders (CVSD, ADPCM, or ADM) 1s inadequate. It appears that the LPc- 
SBC tandem could provide reasonable communication in many circum- 
stances, but that the sBc-LPc tandem is of marginal use. 


6.2 Alternatives to CVSD 


Only the sub-band coder, which is substantially more complicated, 
offers significantly better performance than cvsp over all circuit con- 
ditions, talkers, and input levels. ADM, a double integration version of 
CvspD, has the same subjective quality (within the bounds of experi- 
mental error) and ADPCM is better than CvsD in one tandem direction, 
equal to cvsD in the other tandem direction and worse than CvsD in 
the single link configuration. The ADPCM coder was designed by 
extrapolating, to 16 kb/s, results of an experiment involving 24 kb/s 
and 32 kb/s coders. The result of this design optimization was a coder 
that adapts somewhat more slowly than ADPCM coders used elsewhere. 
It may be that higher quality could be obtained with a faster adaptive 
quantizer in the ADPCM coder. 


6.3 Objective measures 


The wide variety of circuit conditions and speech material either 
destroyed or strongly diluted the value of the objective measures as 
indicators of speech quality. With the wide range of input levels, the 
outputs of differential waveform coders contained various types of 
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additive noise and signal distortion. Meanwhile, the sub-band coder 
and Lpc each have their own peculiar distortions; a reverberant effect 
and a mechanical buzziness, respectively. The presence of all these 
impairments in the single link circuits and their combinations in the 
tandem circuits together present a diversity of quality that would be 
very hard to describe with a single measure. 

While the wide range of circuit conditions produces great subjective 
variability, the variety of speech material seems to have a strong effect 
on the objective measures. We speculate that sentence-to-sentence 
fluctuation in objective measures is greater than that of corresponding 
subjective impressions. 

These irregularities led to regression formulas of considerably less 
accuracy [about 60 percent of variance accounted for by eqs. (28) and 
(29)] than the 70 to 90 percent obtained in other studies.*’* Our work 
lends support to the value of current efforts to find more robust 
objective measures.'”"” 
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Motion-Compensated Television Coding: Part | 
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(Manuscript received August 29, 1978) 


We present methods of estimating displacements of moving objects 
from one frame to the next in a television scene and using such 
displacements for frame-to-frame prediction. Displacement is esti- 
mated by a recursive algorithm which seeks to minimize a functional 
of the prediction error. Several simplifications of the algorithm are 
presented which make tt attractive for hardware implementation. 
Performance of the algorithm is evaluated by computer simulations 
on two sequences of moving images containing various amounts and 
types of motion. In both cases, the use of displacement-based (or 
motion-compensated) prediction results in bit rates that are 22 to 50 
percent lower than those obtained by simple “frame-difference”’ pre- 
diction, which is used commonly in the interframe coders. 


|. INTRODUCTION 


Television signals are generated by scanning a scene several times 
a second even though there may not be any change in the scene from 
one frame to the next. This results in a considerable frame-to-frame 
redundancy in the signal. Existence of this redundancy has long been 
recognized, and several measurements have been made to quantify it. 
However, the first real demonstration of a frame-to-frame coder which 
used redundancy between successive frames was made in 1969 by 
Mounts.’ Since then, several improvements have been made to the 
basic frame-to-frame encoder resulting in prototypes or real imple- 
mentations of coder-decoder pairs.”” These are the subjects of two 
excellent surveys,”’ the first one covering material up to 1972 and the 
second one up to 1978. As is evident from these works, most frame-to- 
frame coders are based on the following: 

(t) Segmenting each television frame into two parts, one part which 
is the same as the previous frame and the other part (called the moving 
area) which has changed from the previous frame. 

(tz) Transmitting two types of moving area information: (a) ad- 
dresses specifying the location of the picture elements in the moving 
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area, and (b) information by which the intensities of the moving area 
elements can be updated. 

(zit) Matching the coder bit rate to the channel rate. Since the 
motion in a real television scene occurs randomly and in bursts, the 
amount of information about the moving area will change as a function 
of time. To transmit it over a constant bit rate channel; (a) smooth 
out the transmitted information rate by storing it in a buffer prior to 
transmission, and (6) use the buffer fullness to regulate the encoded 
bit rate by varying amplitude, spatial, and temporal resolution of the 
television signal. 

Intensities of the moving area picture elements are transmitted by 
predictive coding which sends frame difference, element difference, or 
line difference (or their combination) as the differential signal. At- 
tempts have been made to optimize the picture quality within the 
constraints of the buffer size and the channel rate. 

Simultaneously with these implementations, computer simulations 
have been used to explore other improvements of the frame-to-frame 
coders. It has long been recognized®® that, if an estimate of the 
translation of an object is obtained, more efficient predictive encoding 
can be performed by taking differences of elements in the moving 
areas with respect to elements in the previous frame that are appro- 
priately translated. We refer to such schemes as “Motion-Compensated 
Coding Schemes.” Their success depends obviously on the following: 
(t) the amount of purely translational motion of objects in a real 
television scene,* (ii) the ability of an algorithm to estimate the 
translation with an accuracy that is desirable for good prediction of 
intensity, and (zi) robustness of the displacement estimation algo- 
rithm when amplitude, spatial, and temporal resolution of the trans- 
mitted picture are lowered due to buffer fullness. 

Several methods of estimating the displacement of an object in a 
television scene have been proposed. Methods of point-by-point cor- 
relation’ or pattern matching used in scene analysis’ appear to be 
too complex for present-day implementation, especially if displacement 
needs to be defined with resolution finer than the sampled grid of a 
television frame. Simpler displacement estimation techniques’* uti- 
lize the relation between the spatial differential and temporal differ- 
ential signals. They would be easier to implement. Another approach 
is adaptive linear prediction using elements in the previous frame (or 
field) which are displaced in all directions (horizontal left, right; vertical 
up, down) by a certain maximum amount and adapting the coefficients 
to minimize an intensity error function.’ All these techniques assume 


* So that the television camera sees the objects in pure translation, it is also necessary 
that the lighting be uniform in the camera field of view. 
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that the displacement is constant within a block of picture elements. 
This assumption presents difficulties in scenes with multiple moving 
objects, occluding objects, as well as different parts of the same object 
moving with different displacements. Of course, decreasing the size of 
the block makes this assumption more realistic, but then the quality 
of displacement estimate suffers. 

The techniques proposed by Haskell’® do not require an explicit 
estimate of translational displacement and are applicable even if the 
motion is not precisely translational. However, they also work on 
blocks of picture elements and require a matrix inversion in addition 
to other complicated operations and, therefore, do not appear to be 
easily implementable without a significant simplification. 

We present several new techniques for estimating motion. They 
attempt to minimize recursively a measure of the motion-compensated 
prediction error. Thus, given an ith estimate of displacement, we 
obtain the (z + 1)th estimate such that, in general, the motion- 
compensated prediction error resulting from (z + 1)th estimate is lower 
than that using the 7th estimate. The recursive minimization is per- 
formed by a gradient or steepest descent algorithm. Considerable 
freedom exists in the specific choice of this algorithm. Our choices 
have been guided primarily by a desire to implement this algorithm in 
real time. 

In Section II, we present a derivation of several motion-estimation 
algorithms and describe some simulations to evaluate their perform- 
ance. Section III contains modifications of one of the algorithms of 
Section II for use in a frame-to-frame coder. Here we evaluate the 
performance of the algorithm in the context of a frame-to-frame coder. 
Section IV contains a discussion and the conclusions of our study. 
Many enhancements of the algorithm are possible, and some of these 
will be presented in a companion paper.'® 


li. MOTION ESTIMATION 


In this section, we derive some simple algorithms for estimating 
motion. They attempt to minimize recursively a certain quantity 
(function of the motion estimation error). If the changes in successive 
television frames are due to translation of an object, then the algorithm 
iterates in a gradient or steepest descent direction such that the 
consecutive estimates converge to an estimate of translation. A proof 
of convergence of such a scheme under certain assumptions Is given In 
the appendix and is supported by a large number of computer simu- 
lations presented at the end of this section. 


2.1 Motion estimation in a block of pels 


We mentioned in the introduction that most algorithms in the 
literature for estimating translation of an object from a television scene 
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assume that the translation is constant within a block of picture 
elements (pels). We start with one such algorithm developed by Limb 
and Murphy” and Cafforio and Rocca’ and show how it can be 
modified to obtain a better estimate of translation. This is done 
primarily to define a quantity which is fundamental to our recursive 
algorithm introduced in the next section. 

Assuming a 2:1 interlaced raster format, let J(x, ¢ — 7) and I(x, t) be 
the intensity values of the two successive frames as a function of 
spatial location x (a two-dimensional vector) and time ¢. The time 
between the two frames is fr. If an object moves in translation, then in 
the moving area (disregarding the uncovered background): 


I(x, ¢) =I(x —D,t- 7), (1) 


where D is the translation vector of the object during the time interval 
[¢t — 7, t]. The frame difference at spatial position x is given by 


FDIF (x) = I(x, t) — I(x, t—1) 
= I(x, t) — I(x + D, ¢), (2) 
which can be written, for small D, by Taylor’s expansion about x as 
FDIF (x) = — D’VI(x, t) + Higher Order Terms in D, (3) 


where V is the gradient with respect to x and superscript T on a vector 
or matrix denotes its transpose. If the translation of the object is 
constant over the entire moving area (except for the uncovered back- 
ground) and if the higher order terms in D can be neglected, then both 
sides of the above equation can be summed over the entire moving 

area to obtain a good estimate for translation. We recognize that VI 
can be taken to be a vector of element and line differences (EDIF, 
LDIF) if the intensity is available on a discrete grid as is the case in 
most coding situations. Using linear regression, we get D, an estimate 
of D as: 


| ~1 
p--| » VI tx, £)-VTtx, "| p PDIP (x)-V 1x, 1), 
moving moving 


which can be written as 


; i EDIF?(x) y an s 
D on a . 


Y EDIF (x)-LDIF(x) Y¥ LDIF?(x) 
P FDIF meee 


>| FDIF (x)- LDIF (x) 
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where all the summations are over the entire moving area. This can be 
approximated* by assuming that 


| EDIF (x)- LDIF (x) = 0 
moving 


and then 


(4) 


* FDIF (x)- EDIF (x)/¥. aie 
D mone ° 


\ FDIF (x)- LDIF (x)/¥, LDIF? (x) 
This can be further approximated* by avoiding the multiplications in 
the sums as: 
). FDIF (x)sign( EDIF (x)) 


y | EDIF (x) | 
(5) 


=> 
I 
| 


). FDIF (x)sign( LDIF (x)) 
> | LDIF (x) | 
where 


0, if z=0 
sign(z) = (6) 
——-,» otherwise. 
|z| 
This algorithm is identical to one of the algorithms given by Limb and 
Murphy.” Its accuracy may be improvedy by several modifications 
suggested by Limb and Murphy” and Cafforio and Rocca.” 

We suggest another modification which improves the above motion 
estimator further. First, we note that the above estimates are good as 
long as D is small. As D increases, the quality of the approximation 
becomes poor. This can be overcome to some extent by linearizing the 
intensity function around an initial estimate of D. This is possible in 
a television situation, where there is an estimate of D for every field. 
Thus, for the ith field, displacement estimate D‘ can be obtained by 
linearizing the intensity function around the displacement estimate for. 
the (i — 1)th field. This process results in the following recursion: 


Di =D'+U;, (7) 


* We make no attempts to justify these two approximations. They are made merely 
to derive the algorithm given in Ref. 13. Perhaps, instead of the least squares, some 
other criterion may lead to the same result (i.e., eq. (5)) with fewer assumptions. 

+ The improvement suggested by Limb (Ref. 13) was simulated for comparison 
purposes. However, for the type of pictures used in Section 3.3, the performance did not 
change noticeably by incorporating the improvement. 
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where D*”’ is an initial estimate of D‘ and U' is the update of D‘™ to 
make it more accurate, i.e., an estimate of D — D*”!. 

We now define the quantity pFp (x, D‘"), called the displaced frame 
difference, which is analogous to FDIF (x) used in (4) and (5), 


DFD(x, D*') = I(x, t) -I(« —D*'!, t—-—7) (8) 


DFD is defined in terms of two quantities: (1) the spatial location x at 
which it is evaluated and (ii) the displacement D‘! with which it is 
evaluated. Obviously, in the case of a two-dimensional grid of discrete 
samples, an interpolation process would be used to evaluate I(x — 
D‘', ¢ — r) for nonintegral values of D‘'. As defined, pFp has the 
property of converging to zero as D' converges to the actual displace- 
ment, D, of the image. Following the same steps as were used in the 
derivation of eq. (5), we get: 


DFD(x, D’!) = I(x, t) — I« +D —D*", 2) 
= —(D — D‘")’.VI (x, t) + Higher Order Terms. (9) 
Neglecting higher order terms and making approximations similar to 


the above results in an estimate of D — D*! which, when combined 
with eq. (7), yields: 


¥. DFD(x, D’')sign( EDIF (x)) 
| > | EDIF (x) | 
D‘'=D*" - | ; (10) 
¥. DFD(x, D‘')sign( LDIF (x)) 
>» | LDIF (x) | 
where the summations are again carried over the entire moving area. 
This may be simplified slightly by noting that if the initial estimate 
of displacement D‘” has only integral components, then DFD(-,-) can 
be computed without interpolation. Let [D] denote an integer approx- 


imation to D. This can be obtained either by truncating or rounding 
both the components of the veetor D. 


¥. DFD(x, [D‘"'])-sign(EDIF (x)) 
| \\ | EDIF (x) | 
D‘ = [D*"] - 7 . (11) 
y, DFD(x, [D*"])-sign( LDIF (x)) 
\\ | LDIF (x) | 
We describe the performance of both the above algorithms later in 
this section. 
2.2 Pel-recursive estimation of motion 
We mentioned in the introduction that there is an advantage in 
recursive algorithms which iterate on a pel-by-pel (or on a small block 
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of pels) basis, i.e., they revise their displacement estimate at every 
moving area pel. Such recursive algorithms overcome, to a large extent, 
the problems of multiple moving objects, as well as different parts of 
an object undergoing different displacements, provided the recursion 
has sufficiently rapid convergence. Since we intend to use the displace- 
ment estimator for predictive coding, our algorithm should in some 
manner attempt to minimize the resulting prediction error. Also, since 
the prediction error is calculated for transmission anyway, its use in 
the recursive estimation of displacement does not result in extra 
computations and is therefore advantageous. Thus, if a pel at location 
x, is predicted with displacement D‘! and intensity I(x, — D'!, t — 
7), resulting in prediction error DFD(x,., D‘'), the estimator should try 
to produce a new estimate, D‘, such that|DFD(x., D‘)|<]| pFD(Xa, 
D‘')|. To this end, we attempt to recursively minimize [DFD(x, D)]’ at 
each moving area element using a gradient type of approach. For 
example, 


D‘ = D*? — (€/2)Vp[DFD (ka, D“")’ 
= D*! — eprp(x,, D'')VppFp {k,, D“’), 


where Vp is the gradient with respect to displacement D and € Is a 
positive scalar constant. The gradient Vp may be evaluated using the 
definition of DFp and noting that 


Vp(DFD(x,, D’')) = + VI(x. — D", t—7), (12) 
where V is the gradient with respect to x. This gives us 
D' =D"? - erpv(x,, D')Vi«, -—D*|t-7), (13) 


where DFD and VJ are evaluated by interpolation for nonintegral D‘™. 
A significant reduction in computation of VJ is achieved by quantizing 
D‘*' to an integral value. Thus, if [D‘'] represents a rounded or 
truncated value of each of the components of D‘”', then the estimator 
of eq. (13) can be simplified to 


D' = D*! — eprp(xz, D*)VI(x. — [D"], t —7). (14) 


It should be pointed out that Vp could have been evaluated using (9), 
resulting in an estimator in which VI is evaluated at (Xa, ¢) instead of 
(x, — D‘', t— 1) as above. This second method implies an assumption 
regarding the expansion of I(-,-), which may not be valid if D — D*! 
is large. Also, there is no difference in the computational complexity if 
it is assumed that a linear interpolation of I(x, t — 7) is used to 
compute DFD, and the resulting displaced line and element differences 
are used to define VI of eq. (14). 

It is interesting to observe that at every iteration we add to our old 
estimate a vector quantity parallel to the direction of the spatial 
gradient of image intensity and whose magnitude is proportional to 
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the motion-compensated prediction error. It may be seen from eq. (9) 
that if the displacement error (D — D‘”’) is orthogonal to the intensity 
gradient VJ, the displaced frame difference DFD (-,-) is zero, giving a 
zero update for recursion of eq. (14). This may happen even though 
the object may have actually moved. However, this is not a failure of 
the algorithm, but rather is identical to the situation in which an 
intensity ramp is translated and only motion parallel to the ramp 
direction (VJ) is perceived. Motion perpendicular to the ramp direction 
is unobservable, and as such is arbitrary. It is only through the 
occurrence of edges with differing orientations in real television scenes 
that convergence of D‘ to actual D is possible. 

The quantities involved in the above algorithm are shown in Fig. 1. 
An initial estimate of the displacement at pel x,, D‘', is to be updated 
using (14), yielding D‘. Using the initial estimate D’! and x., the 
samples in the previous frame in the neighborhood of spatial position 
x, — D'” are located (for example: samples b, c, d, e, and f). The 
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Fig. 1—Recursive motion estimation. Displacement estimate D'"' is updated at pel a. 
Gradient of intensity, V(x, — [D'™'], ¢ — 1), is obtained by using intensities at pels b, c, 
d, e, f in the field (7 — 2). 
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samples of this neighborhood are then used to compute the update 
term in conjunction with I(x,, t). Thus, in Fig. 1, the components of 
intensity gradient may be approximated by 


EDIF = (I, — I.)/2 (15) 
LDIF = (Ip — I;)/2. (16) 


Similarly, I(x. — D'’, t — +r) may be approximated by a two- 
dimensional linear interpolation using the intensities of the neighbor- 
hood. In the next section, we present several simplifications of this 
basic algorithm and adapt it for frame-to-frame coding. Recursive 
algorithms, using a model of the video process for motion estimation 
in a different context, are described in Ref. 17. 


2.3 Motion estimator performance 


In this section, we give simulation results for the first two motion 
estimators, algorithm I (eq. (5)) and algorithm II (eq. (11)). Obviously, 
the performance depends upon the type of scene and, even then, it is 
not clear how to measure the performance. We have used two types of 
scenes. The first is produced synthetically using the following formula: 


127 if ||Ri| > 100 
I(x, t) = (17) 


127 (1 + e Fl cos(0.2-7- ||R||), otherwise, 


where R = x — (x, + Dé), 127 is the background intensity (on a scale 
of 0 to 255), x. is the location of the center of the moving pattern at 
t = 0, D is the displacement of the pattern per frame (i.e., velocity), 
and |-| denotes the Euclidean norm. This formula produces a series of 
alternating light and dark concentric rings with exponentially decreas- 
ing radial intensity variation. This pattern was chosen because it 
contains a distribution of edges with different direction and height. 
I(x, t) was sampled both in time ¢ and space x to produce a set of four 
frame sequences with strictly horizontal translation ranging from 0.5 
to 6.0 pels per frame. 

The second type of scene is also a collection of four frame sequences 
containing an object approximately in horizontal translation. These 
are the same as shown in Fig. 1b of Ref. 15. They contain a mannequin’s 
head which was moved at various nominal speeds from 0.4 to 4.7 pels 
per frame. 

The first type of scene was chosen so that an exact velocity was 
known and, therefore, the performance of motion estimators could be 
evaluated rather easily by measuring the deviation from this known 
velocity. A second measure of performance was obtained by computing 
the “match entropy” of the elements in the moving area of each field. 
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That is, using the displacement estimate D', obtained from two con- 
secutive fields, the entropy of DFD(x, D') was computed using a linear 
two-dimensional interpolation over the moving area elements of frame 
I(x, t). This quantity is similar to the entropy of the prediction error; 
however, some future information is used in its calculation. Due to the 
presence of shadows and nonuniform illumination, the second type of 
scene does not possess a precisely defined velocity, and therefore only 
match entropy was used to evaluate the motion estimators for this 
type of scene. 

In all the simulations of the results of algorithms I and II of this 
section and the next section, the moving area elements were deter- 
mined by an algorithm similar to that described in Ref. 13. Likewise, 
the definition of EDIF (x) and LDIF (x) used in the simulations of 
algorithm I may be found in Ref. 13. For E DIF (x), this involved the 
averaging of the element differences at J (x, t) and I(x, t — r). LDIF (x) 
was computed in a similar manner. To extend this definition for use in 
algorithm II, the corresponding differences at I(x, t) and I(x —[D‘'], 
¢ — Tr) were averaged. 

The performance of algorithms I and II is given in Figs. 2a and 2b 
for the synthetic scenes. As seen in Fig. 2a, the error in the estimated 
velocity* using algorithm II is considerably smaller than that of 
algorithm I, especially at higher velocities. The peak observed in the 
estimates of algorithm I is perhaps due to the insensitivity of the 
algorithms to a per-frame shift equal to the period (10.0 units) of the 
synthetic moving image. The initial estimate of displacement for 
algorithm II was assumed to be 0. It is interesting to note that the 
curve of algorithm I is approximately the same as that of algorithm II 
after the first iteration; but after only three iterations, algorithm II 
converges to its curve in Fig. 2a. In Fig. 2b, the match entropies 
resulting from the estimates of Fig. 2a are shown. Again, the superiority 
of algorithm II over algorithm I is clearly seen. Also for comparison, 
we have included the entropies of F'DIF (x) of picture elements in the 
moving area. As expected, both algorithms I and II show significant 
improvements compared to frame differences. These conclusions for 
synthetic pictures remained generally unchanged when the direction 
of motion was changed and when random frame-to-frame noise of —40 
dB (s/n ratio) was added to the scenes. 

Performance of algorithms I and II for the second type of scene is 
given in Fig. 3. Here, since we do not have available an exact velocity, 
we only give the match entropy at the various nominal horizontal 
velocities used to create the scenes. While both algorithms result in 
match entropies considerably smaller than the frame difference en- 


* Average of last four out of six estimates. 
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Fig. 2—Performance of two motion estimators which obtain one displacement esti- 
mate per field on synthetic pictures. Algorithm II results in considerable improvement 
over algorithm I. Zero entropy indicates that, in addition to a perfect estimate, there 
was no interpolation error in evaluating the prediction error. Estimated vertical velocity 
components (in lines/frame) were + 0.3 for algorithm I and + 0.02 for algorithm II. 


tropy, we see that the performance of the two appears about the same. 
This result is perhaps due to the averaging of the displacement 
components over a large block (i.e., the moving area of an entire field). 
It should be mentioned that the performance of both the algorithms 
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Fig. 3—Performance of two motion estimators which obtain one displacement esti- 
mate per field on moving mannequin. 


can be improved by separating the uncovered background, as suggested 
by Cafforio and Rocca.“ 


lll. CODER PERFORMANCE USING RECURSIVE DISPLACEMENT 
ESTIMATION 


In the previous section, we developed motion estimators and dis- 
cussed the performance of estimators which obtain one velocity esti- 
mate per field. We used scenes which had only one object, moving with 
a nearly uniform translational displacement. However, such scenes are 
unrealistic and, therefore, estimators which can dynamically adjust to 
motion of objects are more desirable. In this section, we first describe 
our recursive estimator in more detail and then evaluate its perform- 
ance on scenes that are much more realistic. We then modify our basic 
estimator so that it can be incorporated in a frame-to-frame encoder 
and evaluate the coder performance. We note that we have paid little 
attention to a very important facet of frame-to-frame coders: resolution 
control using the contents of the buffer. It is important that motion 
estimation does not suffer immensely in lower resolution modes of the 
coder. Although some of these issues will be considered in part II,’® 
more realistic performance evaluation can only be done using a hard- 
ware coder working on real scenes. 

The first scene, called Judy, consists of 64 frames (2:1 interlaced 
fields) of 256 < 256 samples each, obtained at 30 times a second and 
sampled at Nyquist rate from a video signal of 1 MHz bandwidth, and 
contains head-and-shoulders view of a person engaged in a rather 
active conversation. The portion of a frame classified as moving area 
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varies from 15 to 51 percent. Also, the motion is not translational, and 
different parts of the scene move differently (such as lips, eyes, and 
head). Four frames of this scene are shown in Fig. 4. 

The second scene, called Mike and Nadine, consists of 64 frames 
with the same resolution as the scene Judy and contains a panned full 
body view of two people briskly walking around each other on a set 
with severe nonuniform illumination. The percentage of a frame clas- 
sified as moving area varied from 92 to 96 percent. Four frames from 
this sequence are shown in Fig. 5. 


3.1 Basic estimator performance 


The basic recursive estimator consists of the following: the displace- 
ment estimate is updated at each moving area picture element using 
eq. (14), ie., 


~ 


D'=D‘''- soy PFD, D'").VI(x; -([D*"],t-7), (18) 
where D’ is the estimated displacement of moving area pel x;, D’~’ is 
the last estimate formed prior to pel x; during the pel-by-pel, line-by- 
line (interlaced raster scan order) iteration through the moving area, 
VI is the spatial gradient of the intensity, approximated by E DIF and 
LDIF defined in (15) and (16), and [D’"] denotes rounded D‘’. A two- 
dimensional linear interpolation is used to evaluate DFD (interpolation 
is discussed in detail in Section 3.3.3). Instead of using the previous 
frame for the intensity I(x, t — r), we have used the previous field. 
Relative advantages of this choice are discussed later. Also, both the 
horizontal and vertical components in the displacement error estimate 
(i.e., the second term of the right-hand side of eq. (18)) are clipped at 
a magnitude of (2s), so that the displacement from pel to pel 1s not 
allowed to change by more than ‘ce pel/field and “ce line/field. This 
avoids the possibility of rapid oscillations in displacement due to noise. 
The accuracy used in computation of displacement and interpolation 
is “2s pel (or line) per field. 

In all the simulations of this section, with the exception of the results 
of algorithm I appearing in Fig. 7, the moving area picture elements 
were determined by the following rule: pel z (Fig. 6) is classified as a 
moving area pel if either 


(i)| FDIF(z)|> Te 


or 
(i)| FDIF(z)| > T; 
and 
| FDIF | at a, b,c, ord > T; 
and 


| FDIF|at A, B,C, orD>T,, 
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Fig. 4—Four frames of the scene Judy. 
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Fig. 5—Four frames of the scene Mike and Nadine. 


pay ele Nn ee Se PREVIOUS 
7 LINES 


a 


INTERLACED/. ~ 
LINES \\ 
\ N~\ 


PRESENT 
LINE 





aa a b Z Cc 





St pac ee ae ea at FUTURE 
LINES 





Fig. 6-—Configuration of pels used in the moving area segmentor. 


where T; and 7, are two thresholds, and T2 = T; in most cases. This 
segmenter is quite similar to the one in Refs. 4 and 15. It overcomes, 
to a large extent, the effects of frame-to-frame noise which otherwise 
produce a large number of isolated moving area elements. For the 
basic estimator, the moving area was segmented with thresholds 7; 
= 4 and T> = 255 (on a scale of 0 to 255). 

The performance of this basic estimator is given in Fig. 7 for the 
sequence Judy. Also for comparison, we have included the entropies of 
the moving area frame differences and the match entropy of algorithm 
I of Section II. On the average, match entropy for the recursive 
estimator was lower by about 1.4 bits/moving area pel than that of the 
frame differences, while algorithm I only resulted in approximately 
half of this decrease.* 


3.2 Basic coder performance 


In order to incorporate the estimator as a part of a coder, several 
other choices have to be made. In this section, we describe a basic 
coder and its performance. The next section contains modifications 
and simplifications of the basic coder and their effects on the perform- 
ance. The basic coder consists of the following: 


3.2.1 Displacement estimator 


In the predictive coder, since the displacement estimate is not 
transmitted to the receiver, it has to be derived from the previously 
encoded and transmitted data. We derive a displacement estimate for 
a previously transmitted pel and use it for computation of the predic- 


* Results in Fig. 7 for algorithm I were obtained by using a slightly different moving 
area segmenter given in Ref. 13. 
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Fig. 7-—-Performance of basic recursive estimator on the scene Judy. Compared to frame difference and the estimator algorithm I, which obtains 
one displacement estimate per field, the recursive estimator results in a lower and relatively constant entropy. 


tion of the present pel. The two natural choices for the relative location 
of the displacement estimate are the pel above and the pel to the left 
of the predicted pel. We have chosen the previous line (i.e., the pel 
above in the same field), since it relieves hardware timing constraints 
that would have otherwise resulted. If the previous line pel is not a 
moving area pel, we use the initial displacement estimate that would 
have been used by the above pel if it were moving. Such use of 
displacement estimates for prediction is shown in Fig. 8. We see that 
several elements (e.g., elements 7 + 1,7 + 2, ---,7 + 4 of the present 
line) may be predicted with the same displacement estimate if the 
corresponding elements of the previous line are not moving area 
elements. Other details of the estimator remain the same as those 
given in Section 3.1 for the basic estimator. 


3.2.2 Segmentation 


Every field was first divided into two segments: moving area and 
background, using the segmentation strategy described in Section 3.1. 
For a good picture quality, the thresholds T; and T2 were chosen to be 
1 and 3 (on a scale of 0 to 255), respectively. Moving area was further 
divided into two segments: (z) compensable regions, where motion- 
compensated prediction is adequate and, therefore, no update infor- 
mation need be sent; and (zz) uncompensable regions which require an 
update (i.e., transmission of prediction error). This segmentation of 
the moving area was performed using the following rule: 

(t) If the magnitude of the motion-compensated prediction error 
for a moving area pel is greater than 3 (out of 255), then it is 
called uncompensable. 

(it) To reduce the occurrence of isolated compensable pels, com- 
pensable pel runs of length 1, 2, and 3 between uncompensable 
pels were bridged by calling them uncompensable. 

This segmentation is shown in Fig. 9. 


Di pitt pit2 pit3 pit4 


/ Zf 4 / / PREVIOUS LINE 


J j+1 [Fr 2 j+3 j+4 PRESENT LINE 
O- BACKGROUND ELEMENT C] — MOVING ELEMENT 


Fig. 8—Use of displacement estimates (D‘) in prediction process. Displacement 
estimates are formed at every moving area pel in the same order as the scanning process. 
Displacement estimate of the previous line element is used for prediction of the present 
line picture elements. 
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Fig. 9—Motion-compensated coder segmentation. Elements whose motion-compen- 
sated prediction error is lower than a threshold are called compensable and are not 
transmitted. 


DECISION LEVELS 1,7,13,21,30,39,49,60,71,82,95,108,121,134,147,160,173,255 


REPRESENTATIVE LEVELS 0,4,10,17,26,35,44,55,66,77,87,102,115,128,141,154,167,179 


Fig. 10—Quantizer for basic coder. Only the positive side of the symmetric quantizer 
is shown. 


3.2.3 Quantization 


The prediction error was quantized using a 35-level symmetric 
quantizer shown in Fig. 10. Other quantizers were also investigated. 
The relative performance of the coder did not depend heavily on the 
quantizer. However, the picture quality degraded as would be expected 
by using a very coarse quantizer. The quantizer of Fig. 10 was chosen 
since it gave good picture quality, although the quantization error was 
clearly visible to a trained eye. 

The performance of this basic coder was evaluated by computing 
the following quantities: (z) the entropy of the prediction errors of 
uncompensable pels, and the entropy of the run lengths defining the 
(tz) background pels, (ii) compensable pels, and (tv) uncompensable 
pels. The last three quantities are the addressing costs associated with 
not sending the prediction errors for certain pels. Figure 11 shows the 
overall bit rate associated with frame difference conditional replenish- 
ment encoding and motion-compensated encoding for both of our test 
sequences. Conditional replenishment encoding was done using the 
quantizer of Fig. 10 and moving area segmentation similar to that of 
the motion-compensated coder. The quality of pictures for both cases 
appeared to be approximately the same. Two characteristics of the 
motion-compensated coding are worth noting: (z) the average bits/ 
field is reduced by about 44 percent; (iz) more importantly, during the 
times when large amounts of frame difference data are generated due 
to either rapid movement or a high percentage of moving area, the 
motion-compensated coder does significantly better. This type of be- 
havior may help to decrease the occurrence of temporal overload 
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Fig. 11—Performance of basic motion-compensated coder and its comparison with a conditional replenishment coder for (a) the scene Judy and 
for (b) the scene Mike and Nadine. 


associated with most conditional replenishment coders. Part of the 
reason for such a significant reduction in bit rate due to movement 
compensation is seen from Figs. 12 and 13 which show the results for 
conditional replenishment and motion-compensated coding, respec- 
tively. In these figures, the pel intensity is proportional to the bits 
required to code the prediction error for that pel. Uncompensable pels 
form a very small portion of the moving area and their prediction error 
is also small. For the scene Mike and Nadine, the output of each 
encoder is more than twice that obtained for Judy. This increase is 
mainly due to the panning of the camera. For this scene, intended as 
a severe test for the encoders, motion-compensated encoding resulted 
in a 22-percent reduction in average encoded bits. 

Figure 14 shows a breakdown of the bits required for transmission 
of addresses and prediction error for scene Judy. It is interesting to 
note that, on the average, for the conditional replenishment coder the 
addressing bits comprise only about 15 to 30 percent of the total 
transmitted bits, but in the movement-compensated coder, since the 
magnitude of the prediction error is decreased, the addressing bits 
account for a much larger fraction of the total bits (50 to 65 percent). 
More efficient addressing methods are therefore desirable with motion 
compensation. 


3.3 Variation in coder structure 


In this section, we simplify the coder and evaluate the effect of 
various modifications on the coder performance. 


3.3.1 Coder prediction 


It is known’ that, in conditional replenishment encoders, instead of 
using a frame differential prediction, line or element difference of the 
frame difference can be used with advantage. Thus in Fig. 8, for picture 
element x;+43, the conditional replenishment coder would send 
[| FDIF (x;+3) — FDIF(x,;+2)] as an element difference of frame differ- 
ence prediction error. Similar modifications can be made to the motion- 
compensated coder. Here we transmit line or element differences of 
the displaced frame differences. Thus for picture element xj;+3, we 
transmit [DFD (x;+3, D‘t*) — pFp (x;+2, D'**)] as an element difference 
of displaced frame difference prediction error. In cases such as pel x; 
of Fig. 8, where the previous element is not in the moving area, their 
FD as well as DFD is assumed to be zero. The performance of this 
predictor modification is given in Fig. 15. As seen from this figure, 
element differencing improves both the frame differential and motion- 
compensated coders by about 5 to 15 percent. 


3.3.2 Addressing of moving area 


In motion-compensated coding, the addressing information takes up 
a large fraction of the total transmitted bits. We have therefore used 
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Fig. 12—Transmitted amplitude information for a conditional replenishment coder. Intensity of a picture element is proportional to the bits 
required to code the prediction error using frame difference prediction for four frames of scene Judy (Fig. 4). Addressing information is not included. 
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Fig. 13—Transmitted amplitude information for a motion-compensated coder. Intensity of a pel is proportional to the bits required to code the 
prediction error using the basic coder of Section 3.2 for four frames of scene Judy (Fig. 4). Addressing information is not included. 
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Fig. 14a—Components of conditional replenishment coder output. Approximately 80 percent of output is devoted to prediction error. 
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Fig. 14b—Components of motion-compensated coder output. Note that addressing information accounts for more than 50 percent of the coder’s 
output, whereas the conditional replenishment output is mostly prediction error. 
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Fig. 15—Performance with predictor modification for sequence Judy. Element difference of either frame difference or displaced frame difference 
lower the entropy. 


some known” addressing techniques and evaluated their effects on the 
total bit rate. As in Ref. 18, three techniques for addressing the moving 
area were tried. In the first technique, the beginning of each moving 
area was given an absolute address (8 bits) with respect to the begin- 
ning of the scan line, and the end of moving area cluster was given a 
special flag word different from all the prediction error codes. This is 
similar to the technique used in Ref. 2. The second technique” was to 
address the beginning of each cluster of moving area with respect to a 
similar cluster in the previous line as long as the magnitude of this 
differential address was less than +8; otherwise, the cluster was ad- 
dressed with respect to the beginning of the line. Maximum differential 
address of 8 was chosen after experiments with 2, 4, 16, and 32. The 
final technique is to address all the clusters as one-dimensional run 
lengths, as in our previous sections. The performance of these three 
addressing schemes for both the conditional replenishment and mo- 
tion-compensated coders is given in Fig. 16 for sequence Judy. As 
expected, absolute addressing requires more bits than either differen- 
tial or run-length addressing. Run-length addressing is the most effi- 
cient and decreases the overall bit rate by about 10 percent compared 
to the absolute addressing in a motion-compensated coder. 

It should be noted that, to precisely evaluate the addressing cost 
associated with motion-compensated coding, it is not adequate to use 
only the run-length entropies of the three types of segments. Knowing 
the type of the last run, the receiver must be told, in some fashion, 
which of the two possible types the next run is. This information has 
been included in the simulations of Fig. 16. Comparing the motion- 
compensated coder using run-length addressing for moving area as in 
Fig. 16 with the results shown in Fig. 11 which does not include the 
next segment type of information, we see that this information ac- 
counts for about 8 percent of the bits. 


3.3.3 Interpolation 


The motion-compensated coder discussed above uses previous field 
intensities for interpolation rather than previous frame intensities. Use 
of previous field has two advantages: (1) the displacement values are 
generally smaller, since fields occur at every 1/60th of a second, 
whereas frames occur at 1/30th of a second, and (ii) in a hardware 
coder, since only a certain number of elements from the previous field 
or frame would be made available for interpolation with the same 
number of adjacent available elements, almost twice the amount of 
motion can be accommodated. On the other hand, one disadvantage of 
using the previous field intensities is that, for a perfectly horizontal 
motion, intensities have to be obtained from the previous field by 
interpolation (due to the interlace) which introduces error in the 
computation of pFrp. Another factor, which appears to be a disadvan- 
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Fig. 16—Performance of moving area addressing schemes. Run-length addressing does better than absolute or differential addressing. 


tage but which can be overcome, is the use of an alternate field 
dropping coder mode. To relieve a buffer overflow condition caused by 
a peak in coder output, it is advantageous to drop alternate fields as is 
done in conditional replenishment coders,*” however; the previous 
field is then no longer available for motion estimation. In any case,we 
experimented with both the previous field and frame intensities and 
found that the use of previous field intensities resulted in entropies of 
unquantized moving area prediction errors which were 5 to 15 percent 
lower than those of previous frame intensities. 

Of the many interpolation techniques possible, we restricted our 
attention to linear interpolation since our goal was a coder that could 
be implemented in real time. Thus, given the four surrounding pels as 
in Fig. 17, if the displacement D is written as the sum of an integral 
part D’, and a fractional part D’ with component magnitudes D{ and 
D4, then the intensity can be interpolated by a standard two-dimen- 
sional linear interpolation as: 


=(1— D$)[(1—- Df) Ip + DiIc] 
+ Di[-—Di)b+ Di Li), (19) 


where J is the interpolated intensity. It should be noted that eq. (19) 
has been used for all simulations previously described in this paper. 
Since this interpolation formula requires a large number of time- 
consuming multiplications, we tried another approach. We first select 
the three nearest (in the sense of the Euclidean norm) neighboring 
pels out of the four (e.g., pels B, C, and D in Fig. 17). This may be 
done by rounding D to form D’, thus locating the nearest neighbor, 
pel D, which is taken to be the corner of the three, then testing the 
sign of the rounding errors to locate the other two pels. The following 
formula is then used for interpolation: 


I=Ip+ D{(Ic— Ip) + Db (Ip — I). (20) 


SCAN LINES IN 
THE SAME FIELD 





Fig. 17—Two-dimensional linear interpolation. Displacement D is decomposed into 
integral part D’ and fractional part D“. 
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Simulations using sequence Judy, comparing eqs. (19) and (20), showed 
very little differences in entropy, and since the interpolation of eq. (20) 
is significantly simpler, it is more suitable for a real-time implemen- 
tation. 

The precision with which the interpolation computation is per- 
formed was also varied. In particular, we used precisions of 2"; n = 1, 
2 +--+, 7 pel (or line) per field for displacement estimates, and found, 
as expected, that the average bits/field increased as the precision was 
decreased. The average increase in bits/field relative to a precision of 
Yi28 pel/field was as follows: negligible increase for precisions greater 
than % pel/field, 2 to 3 percent for % pel/field, 6 to 10 percent for % 
pel/field, and 15 to 25 percent for % pel/field. Thus, it appears that for 
interpolation, a resolution of % pel/field does an adequate job with 
much lower complexity compared to resolution of “2s pel/field. It 
should be noted that the recursion of eq. (18) was calculated by using 
a precision which is higher than the one used for the calculation of 
interpolation. 


3.3.4 Estimator parameters 


In eq. (18), the € was taken to be “ozs and the update term was 
clipped to (“c) pels/field. Both these quantities were varied over a 
wide range. Performance due to such variations is given in Fig. 18. 
This shows the robustness of the estimator. Of course, increase of both 
e and the clipping level to a very high value results in a noisy estimator, 
but it gives the ability to adjust quickly to rapid changes in motion. 
On the other hand, a small value of € allows us to converge to a finer 
value of displacement, thereby allowing low prediction error. A com- 
promise between these two conflicting goals appears to be the values 
of « and clipping level used in (18). 

We mentioned earlier that the recursion of eq. (18) was calculated 
at a resolution of “es pel/field. Simulation results on Judy indicate 
that, when the interpolation is calculated with resolution of % pel/ 
field, the recursion of eq. (18) does not need to be calculated at 
resolutions higher than Me pel/field, i.e, there is no significant improve- 
ment in performance by increasing the resolution beyond Me pel/field 
in the calculation of displacement. To make the simulations realistic, 
all the computations were performed using integer arithmetic (on the 
computer). 

Within our investigation into the effect of precision, the following 
simplification of the update term was simulated: 


D‘ =D"? — e-sign(prp(x;, D‘“!))-sign(VI(x;,{D*"])), (21) 


where sign of a vector quantity is the vector of signs of its components. 
The sign function, defined by (6), avoids the multiplication necessary 
for computation of the update term. Thus the update of each displace- 
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Fig. 18—-Performance change due to the variation of rate and the maximum level of displacement update in motion estimator for sequence Judy. 
Results show robustness to variation of these parameters. 


ment component from one picture element to the next consists of only 
three possibilities; 0 or te. The performance using such a simplification 
is given in Fig. 19. It is clear from this figure that the performance in 
terms of bits/field is virtually unaffected by using the simplified update 
term of eq. (21). 

Earlier we mentioned that we computed the displacement from the 
previous line of video and used it for computation of the motion- 
compensated prediction of the present line. This was done so that the 
displacement computation would have enough time to finish in a real- 
time encoder. It is possible, at least in simulations, to use the displace- 
ment of the previous moving area element in the same line for 
computation of the motion-compensated prediction of the present pel. 
This variation improves the performance of the encoder by about 6 to 
10 percent in terms of bits/field. 

The last simplification we attempted was in the calculation of the 
intensity gradient. Our hope was to use the same element and line 
differences which would be used in the computation of the DFD, so 
that this computation could be shared. Figure 20 shows three possible 
configurations of picture elements and the corresponding intensity 
gradients that were simulated. In the first and second configuration, 
the elements are those used for the interpolation of eqs. (19) and (20), 
respectively, while configuration III replaces pel D of configuration II 
with two pels further away. Coder performance using the three inter- 
polation configurations was similar, and since the second configuration 
is the simplest, it is more suitable for hardware implementation. 


3.4 Combined effects of simplifications 


We have so far evaluated the changes in coder performance due to 
each individual simplification. In this section, we evaluate the com- 
bined effects of the following simplifications: 

(z) The moving area is specified with absolute addresses and the 
next segment-type information is transmitted as in part 2 of the 
last section. 

(tt) The sign (-)-sign (-) estimator of eq. (21) is used for displace- 
ment estimation. 

(zi) The three-point interpolation of eq. (20) is used for intensity 
prediction and displacement estimation at a precision of % pel/ 
field. 

(wv) The line and element differences of interpolation eq. (20) are 
used as the intensity gradient components of eq. (21). 

As seen from the simulated performance of the coder in Fig. 21, the 
modifications have had only a minor effect on the encoded bits/field. 
The simplified basic coder represents what we believe to be a practical 
algorithm for real-time implementation. A simplified block diagram of 
this coder appears in Fig. 22. The inner loop, which comprises the 
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Fig. 19—Coder performance using a sign (-)* sign (-) motion estimator update for sequence Judy. 
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displacement estimator, attempts to match interpolated intensities of 
the last field with those of the last line by adjusting the displacement. 
The outer loop uses the displacement estimates to predict the video 
input with interpolated last field intensities. Based on frame difference, 
the segmentor limits displacement estimation (switch A) and motion- 
compensated prediction (switch B) to the moving areas. The segmentor 
also controls the selection of errors for transmission with switch C. 
Hardware for controlling the coder’s output and matching it to a 
constant rate channel has been omitted in this representation. How- 
ever, it has to be included for any real-time implementation. 


IV. DISCUSSIONS AND CONCLUSIONS 


We have developed recursive estimators in this paper and simplified 
them so that a real-time implementation would be possible. Some 
characteristics of our final estimator are that it needs computation of 
interpolation intensities at resolution no more than % pel/field (which 
can be done without multiplication). We believe that this is in the 
realm of present-day circuit speeds. Also, by the very nature of the 
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Fig. 22—A simplified block diagram of motion-compensated coder. 


recursive estimation, we do not need the assumption of uniform 
translational motion of a single object, which appears to be necessary 
for most of the heretofore known estimates. Performance of the simple 
estimator on the sequence Judy shows that a 30-to-50 percent improve- 
ment in bits/field is possible compared to frame-difference conditional 
replenishment. We also evaluated the performance of our simple 
estimator on a more difficult scene: Mike and Nadine. Both conditional 
replenishment and motion compensation generate significantly more 
data for this scene than for Judy due to the high percentage of moving 
area pels. However, the motion-compensated coding decreases the 
transmitted bits/field by approximately 22 percent. Thus the improve- 
ment is somewhat lower for this scene. Some of the modifications 
suggested in Part II’® improve the performance for this type of scene. 
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APPENDIX 


In this appendix, we show that the gradient algorithm of eq. (13) 
which attempts to minimize DFD(- , -) converges, under suitable con- 
ditions, to true displacement. As in Section II, let us assume that an 
object is moving with pure translation in the field of view of the 
camera. Neglecting the uncovered background and assuming a con- 
stant and uniform displacement per unit time, we get 


I(x, t) = I(x — D, t—7), (22) 


where D is the true displacement and 7 is the frame time. The pel- 
recursive motion estimation algorithm of eq. (13) can be written as: 


D' = D*! — eprp(x,, D')VI(x. — D*, t — 7). (23) 
Using the definition of DFD(- , -) from eq. (8), w get 
Di = D* — € {I (Xa, t) 
— I(x. — D'?, t — r)} VI(xa — D*", t — 7). (24) 
Substituting for (x2, t) from (22), 
Di=p*!- e {I (x, — D, t — 7) 
— I(xa — D*', t — r)} -VI (xa — Dé’, t — 7). (25) 
Using Taylor’s series, we can write 
I(xa — D, t-— 7) — I(x. — D™, t — 7) 
= (D'' — D)?-V1(x,. — D*!, t- 7) 
+ higher order terms in (D'' — D). 
MOTION-COMPENSATED TV CODING—I 667 


Substituting in (25), 
D=D''- e{(D'” — D)’V1(x. -D'),t- T)} VI(Xa — D'?,t-7) 
+ higher order terms in (D'! — D). 
= Di? - e{VI(x. — D*’, t — r)-VI (x. — D™, t — r)"} (D* - D) 
+ higher order terms in (D*! — D). 
Subtracting D from both sides, we get 
(Di — D) = (DH — D) — e{VI(x. — D*, t— 7) 
V1 (xe — D, t— 7)7} (D — D) 
+ higher order terms in (D’! — D). 
Neglecting higher order terms,* for small (D'™! — D), 
(D‘ — D) 
= [J — €{VI (xe — DO, t— 7)-VI (xa — D7, t— 7) 7} — D) 


where J is an identity matrix of appropriate size. We now take 
statistical averages of both sides, assume statistical independence of 
the two right-hand terms,; and then apply Schwartz inequality to get 


| D'- D| 
< || J — e{VI(x. — D'', t-— 1) VI (x. — D'', t—7)7} | 
.||D- DI (26) 

where a bar on top denotes a statistical average. This can be written 
as: 

| D' — Dl] <|[1 — eAma|-|D°? - DI, (27) 
where Amax is the maximum eigenvalue of the positive semidefinite, 
symmetric matrix VI(x. — D'', t — 7)VI(x. — D*!, t — 1)”. For 
convergence of the algorithm, we need 

| 1 — €Amax| < 1, 


1.e., 





>e>0. (28) 
Since the maximum eigenvalue is hard to compute, and since it is 
upper bounded by the trace, we get 


* This may be a severe assumption in certain cases. In such cases, the ability of the 
algorithm to converge may depend upon the goodness of the initial estimate D. 

+ This is traditionally done in stochastic gradient algorithms. See, for example, Refs. 
19 to 21. 
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2 


ne . 9 
tr[VI (xa — D'?, t-—7)VI&. —D*", =F le e> 0 (29) 


But since 


tr[VI(xa — DX}, t — 7) VI (xa — D™, t — 7)7] 


EDIF? + LDIF?, 


M = 


ee 
= 7, 


1 


we get the following condition for convergence 


2N 
>e> 0, (30) 


Ne 
> EDIF; + LDIF? 


i=] 


where the summations are carried over the entire moving area con- 
taining WN pels. 
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Multiqueue Systems 
with Nonexhaustive Cyclic Service 


By P. J. KUEHN 
(Manuscript received July 14, 1978) 


Queuing models with cyclic-type service are applicable for perform- 
ance studies of polling mechanisms in data communication and 
switching systems or cyclic scheduling algorithms in real time com- 
puters. This paper provides an approximate analysis of the multi- 
queue system M'*!/G/1 with batch Poisson input, general service 
times, general overhead (switchover) times, and a single server op- 
erating under a cyclic strategy with nonexhaustive service of queues. 
Based on a new concept of conditional cycle times, the generating 
function of the stationary probabilities of state, the Laplace-Stieltjes 
transforms of the delay distributions, and the mean waiting times are 
derived explicitly for each queue through an imbedded Markov chain 
approach and an independence assumption. The approximate ana- 
lytic results are validated by computer simulations. Besides this 
analysis, a stability criterion is derived for the general case of GI/G/ 
1 systems with cyclic priority service. The paper concludes with a 
number of studies of the behavior of cyclic queues discovering inter- 
esting properties such as the dependence of cycle times and waiting 
times on the arrival and service process types and on the efficiency of 
cyclic priorities. 


|. INTRODUCTION ; 

Cyclic service is a frequently used mechanism for the information 
transfer between peripheral units and their centralized control opposed 
to asynchronous or synchronous interrupt mechanisms. In a cyclic 
service operation, the centralized control scans the peripheral units in 
a cyclic sequence. At each peripheral unit, the queue of waiting items 
(user or control data) is served either completely (‘exhaustive service’) 
or up to a specified maximum number of transferred items per scan 
(“nonexhaustive service’) until the centralized device switches over to 
the succeeding unit within the cycle sequence. Examples of this type 
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of operation are found in data communications systems (polling, asyn- 
chronous multiplexing), telephone switching systems (device scan- 
ning), and certain [/O mechanisms of real-time computers. The per- 
formance of these cyclic service mechanisms 1s of considerable interest 
for traffic engineering, namely with respect to throughput and resource 
utilizations, delays, unbalanced load, overload behavior, and the influ- 
ence of various statistical properties of the traffic. . 

In the sequel, we refer to the general cyclic queuing model shown in 
Fig. 1. There are g arrival groups of “customers” and their correspond- 
ing waiting lines (queues). Customers of group / arrive according to a 
general independent (GJ) arrival process with probability distribution 
function (pdf) A;(t) = P{T4; = t}, where T'4; denotes the random 
variable of the interarrival time in queue /, 7 = 1, 2, ---, g, and A; = 1/ 
ET.,,; defines the arrival rate of customers in queue /. Special cases of 
the GI arrival processes are: D (deterministic), M (Markovian), Ex 
(Erlangian order k), or Hz (hyperexponential order 2). In case of batch 
arrivals, the arrival process is defined by both the random interarrival 
time 73; of batches and the random batch size K;, 7 = 1, 2, ---, g. The 
batch size in queue / is given by its probability distribution qj, = P{K; 
= k}, k = 0,1, ---. The total arrival rate of j-customers A; and the 
arrival rate of batches Aj; are related to each other through A; = Ag;- 
EK,;. For the special case of deterministic arrival processes in more 
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Fig. 1—Cyclic queuing model. 
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than one queue, a “phase shift parameter” is additionally needed 
which describes the relation between the periodic arrival patterns in 
those queues. 

Similarly, customers of queue / receive a random service time Tx; 
with pdf H;(¢) = P{ Tx, s t} and mean A; = ETy,,7 = 1, 2, ---, g. Once 
the server has finished service at a particular queue /, it switches to 
the succeeding queue in a finite switchover (overhead) time Ty; with 
pdf U;(#) = P{Tvu; Ss t} and mean u; = ETy;, 7 = 1, 2, «++, g. 

Finally, the general nonexhaustive cyclic operation of the server 
may be specified by a sequence (“cycle”) {%, t2, «++, Wz}, where z € (1, 
2, -++, g) denotes the number of that queue which is served in kth 
position within the cycle (J = cycle length). The sequence {1t1, iz, «+>, 
t1} is repeated in a cyclic manner. If there is no customer to serve from 
the z,th queue, the server switches over to the 1,4;st queue (modulo J). 
An example of this general (mixed) cyclic sequence for g = 3 queues 1s 
{1, 2, 1, 3, 1, 2}, where / = 6. In this case, an overhead phase is inserted 
after every queue visit. Cyclic schedules with different frequencies of 
visits at the various queues within a cycle will also be referred to as 
“cyclic priority service.” An important special case of cyclic priority 
service is obtained when all visits at a particular queue within a cycle 
are clustered such that the server attends queue 1 successively up to 
l, times, queue 2 successively up to /2 times, and so on (J=4) ++ - 

- + J,). In this case, an overhead occurs only when changing to 
another queue. Limiting cases of this schedule are cycles with J; = 1,7 
= 1,2, ---, g (“ordinary cyclic service” {1, 2, -»-, g}) or cycles with J; 
> 1,7 =1, 2, ---, g (“exhaustive cyclic service”). The queuing analysis 
in this paper is limited to the practical important case of ordinary 
cyclic service; for stability and simulation studies the more general 
(nonmixed) cyclic priority service will be considered. 

Queues with cyclic service have received considerable attention in 
literature (see Refs. 1 to 16). Cyclic queues with exhaustive service 
with or without overhead have been treated in case of M/G/1 models’ ® 
and in case of discrete arrival and service processes.” The case of 
nonexhaustive cyclic service involves considerable mathematical dif- 
ficulties and has been treated rigorously only for M/G/1 models with 
two queues without overhead.'"’ Because of the mathematical intract- 
ability of most cyclic queuing problems, several approximate methods 
were suggested.’*"'® The approximate methods usually rest on some 
simplifying assumptions such as the “independence assumption’” 
under which the stochastic processes within a particular queue are 
considered more or less independent of the processes within the other 
queues. 

In Section II of this paper, we first derive a stability criterion for 
queues of the type GI/G/1 with cyclic priority service. Section III 
deals with the cycle time analysis for GI/G/1 queues in case of ordinary 
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cyclic service. Sections IV and V present an analysis of the probabilities 
of state and the waiting times for cyclic queuing models of the types 
M/G/1 and M!1/G/1 with ordinary cyclic service and general over- 
head, respectively. In Section VI finally, we report various numerical 
results of the approximate analysis and of computer simulations for 
validation and qualitative performance studies. Some of those results 
discover new insight into the properties of cyclic queues and could 
have direct consequences for system engineering and future research 
as well. 


ll. STABILITY OF CYCLIC QUEUES 


Contrary to most standard queuing problems, an obvious and simple 
criterion does not exist under which a queue in a cyclic queuing system 
stays stable. In the following sections, we develop a stability criterion 
for GI/G/1 multiqueue systems with a (nonmixed) cyclic priority 
service. 


2.1 A Stability criterion for queues with cyclic priority service 


Following analogously to a common definition for stability in system 
theory, a queuing system will be called “stable” if for positive service 
times and finite input rates the average queue lengths are limited (note 
that a stationary queue is stable, whereas a stable queue need not 
necessarily be stationary). Additionally, we assume that all arrival and 
service processes are stationary so that the following reasoning can be 
based on average values independent of specific distributional assump- 
tions. 

Let 7c be the random cycle time, c = ETc the average cycle time, 
and Co = U; + Up + +--+ + Ug the average of the cycle time under the 
condition that no customer is served during a cycle. The average 
number of arriving 7 customers during a cycle 1s n; = A; c. In the 
stationary state of the system, the average number of arriving / 
customers equals the average number of served j/ customers, j = 1, 2, 

--, g. Thus, we have 


& 
c= cot ) (Aye) hy, 
J=1 


from which we find the result 


Co 


(1) 





C= ) 
1 — po 


where po = pi + p2 + +++ + pz defines the total server utilization and 
p; = A;h; is the server utilization by 7 customers only, j = 1, 2, ---, g. 
The result according to (1) has already been discovered for cyclic 
queues with exhaustive and ordinary cyclic service. To find the bound- 
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ary of system stability, we proceed as follows: First, we state a stability 
criterion for a particular queue J under the condition of stability of the 
residual queues. This condition can always be achieved for sufficiently 
small arrival rates in the residual queues. The whole system is stable 
if and only if all individual stability conditions are satisfied simulta- 
neously. 

Under the condition that all queues »y ¥ 7 are stable, queue / 
approaches the stability boundary as n; — J;; this corresponds to a 
maximum arrival rate A; max at the margin n; = J; and an average cycle 
length c;}: 


i 


Co + Lh; 
Nj etas = oe? where cH = hited 


——_—_——_——_. (2a 
G 1 — po + p; 


Thus, the system is stable if for all queues 


i; 


Aj < Aj max =———— 
' q Co + Gh; 


(l—potpy), J=1,2,+-+,g, (2b) 
are fulfilled simultaneously. In a similar way, criteria of partial stability 
can be stated in cases where some queues are saturated (a saturated 
queue 2 contributes to the average cycle time by J];-A;). Finally, it 
should be noted that the average cycle time c stays always stable since 
Cc=cot hh, + se. + Lhg. 


2.2 Examples 


To further explore the stability criterion, consider the example of 
g = 2 queues. From (2b) we find the following relationships between 
A; and Xa: 

i ree, oe tb (1—AiAi), (8a, b) 
1 Co + hhy 2ft2), 2 oo Bhs 1/el), ’ 
where Co = UW + U2. These relationships are shown graphically by two 


marginal lines in Fig. 2 with the intersection 
lj 


——— 
i Co + Lh, + bho 


J =1, 2. (4) 
The absolute stable region is below the hatched area when both 
individual criteria (3a), (3b) are fulfilled simultaneously. For A; < Ai 
queue 2 always saturates first, whereas for \2 < Az. queue 1 saturates 
first. At the intersection (Aio, A20), both queues saturate simultane- 
ously. Similarly, for A; > Aico, A2 > Azo, both queues are saturated 
(absolute unstable region). Within the intermediate regions A, > /,-(1 
— Azh2)/(co + hihi), A2 < Ago (or Az > le (1 — Arh) /(Co + Lhe), A1 < 
Aio), queue 1 (or queue 2) is saturated, whereas queue 2 (or queue 1) 
is stable; in these regions of partial stability, the cyclic queuing system 
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Fig. 2—Stability regions for a GI/G/1 queuing system with 2 queues and cyclic 
priority service. 


can be considered as consisting only from the stable queue where the 
contribution of the unstable queue to the cycle time affects as an 
increased cycle time overhead of J; (or /2) consecutive service times 
Tm (or Tx2). Furthermore, starting from any point within the absolute 
stable region and increasing A; and A2 simultaneously, we state that 
the queue with the greater A,/l; ratio reaches saturation first, inde- 
pendent of the service and overhead time parameters. This statement 
differs from many other queuing stability criteria. 

Finally, we discuss briefly two important special cases of the above 
example. The first special case is that of ordinary cyclic service (l, = 
lo = 1). The intersection in Fig. 2 falls on the median A, = Az. This 
means that the queue with the greater arrival rate always saturates 
first. This result was already found by M. Eisenberg"! for M/G/1 
systems without overhead. Additionally, the average number of / 
customers served during a cycle is identical with the probability a; 
that the server meets at least one customer in queue /: 


Co 


—-);, Jjs=1,2. (5) 
1— Po 
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The second special case is that of exhaustive cyclic service (i), l2 >> 1). 
In this case, both marginal lines fall together on A; hi + Azh2 = 1. The 
stability criterion is simply 4,41 + Azh2 = pi + p2 = po < 1. Approaching 
the stability margin, both queues saturate simultaneously independent 
of Co. 


lll. CYCLE TIME ANALYSIS 


In this section, we consider multiqueue systems of the type GI/G/1 
with overhead and ordinary cyclic service. Since the queuing analysis 
in Sections IV and V is based on the knowledge of cycle time, we 
briefly discuss a known result and then develop an improved approx!- 
mate solution for the pdf of the cycle time. 


3.1 Cycle time analysis by Hashida and Ohara 


The exact solution for the pdf of the cycle time Tc is still unknown, 
except for the mean c in (1). Based on the probabilities a; in (5) and 
the approximation assumption of independence, Hashida and Ohara” 
gave the following expression for the Laplace-Stieltjes transform (LST) 
of the cycle time pdf C(t) = P{Tc = @}: 


g a 
oc(s) = I] gui(s)-]] (aipui(s) + [1 — ai). (6) 


In (6), dc(s) = fo- e-“dC(t) defines the Lst of C(t); similarly, dui(s) 
and gui(s) denote the LsTts of U,(t) and H(t), respectively. The 
expression (6) follows directly when considering Tc as a sum of 
independent random variables. 

From (6), the exact mean cycle time c follows straightforwardly and 
_ agrees with (1). However, it was found by intensive simulations (some 
of them are given in Section VI) that (6) underestimates the cycle time 
variance and, herewith, also the mean waiting times. For this reason, 
we shall now improve the cycle time analysis by introduction of a new 
concept of “conditional cycle times.” 


3.2 Conditional cycle times 


The basic idea of the queuing analysis in Ref. 13 and in Sections IV 
and V of this paper is the description of the queue length of a particular 
queue 7 at the scan instant by an imbedded Markov chain. The 
influence of all queues v #7 on the considered queue / will be expressed 
only through the cycle time. The cycle time J¢ is the time interval 
between two successive scan instants of a queue (say, 7). However, a 
particular realization of Tc clearly depends on whether a / customer is 
served or not in a cycle. Therefore, we introduce two conditional cycle 
times 7'c, and T'c, with respect to the considered queue / for cycles 
without or with a service time contribution to the cycle by a customer 
of queue /, respectively. The corresponding cycles are denoted by C; 


MULTIQUEUE SYSTEMS 677 


and C;’. Since ETc, > ETc, it is more likely that after a long cycle 
another long one is followed, and vice versa. Thus, the concept of 
conditional cycle times increases the cycle time variance through a 
reduction of the independence assumption by explicit consideration of 
some history of a current cycle. 

Let Cj(¢) and C;’(¢) be the pdfs of the conditional cycle times Tc, 
and Tc;, and aj, aj; the corresponding probabilities for the service of 
an 2 customer (1 # 7) during a conditional cycle without or with a/ 
service, respectively. Then, it follows by similar reasoning as for (6): 


g 
oc(s) = I] dui(s)-[] (ajpui(s) + [1 — aj), (7a) 


& 


oc;(s) = I] oui(s) TT (aj bui(s) + [1 — aj ])-bx;(S). (7b) 


i= 


For the (unconditional) cycle time 7c we find from the law of total 
probabilities 


oc(s) = (1 — a)be(s) + ajbc;(s). (8) 


With c; = ETc and c;’ = ETc; we state the conditional cycle time 
balances: 


cj =Cot ¥ ahi, (9a) 
inj 

cy =cot ¥ ajih + hy, (9b) 
Ih] 

c= (l—a,)cj t+ ajc. (9c) 


Similarly, as in (5), we assume 
Aji = AiCj, (9d) 
aji = Aic}, Lx J. (9e) 
Inserting (9d), (9e) in (9a), (9b), we find 


Co 
cj =——___., (10a) 
"1 po + 9; 
+ h; 
1 — po + pj 


Note that the exact value of c in (1) follows from (9c) and (10a), (10b). 
It should also be mentioned that the solution (10a), (10b) holds only 
as long as aj; = 1. This condition is always fulfilled in case of symmet- 
rical load (A; = Az = +++ = Ag, ki = ho = --- = hg). In case of higher 
unsymmetrical loads, it can indeed happen that a;; > 1 so that af can 
no longer be interpreted as probability; this difficulty can be overcome 
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by a suitable limitation of aj; by 1 (1.e., queue z always contributes a 
service time to the conditional cycle time Tc;). 

With (9d), (9e) and (10a), (10b), the conditional cycle time pdfs C;(¢) 
and C;’(t) are completely defined. The mean conditional cycle times 
c; and c;’ are given by (10a), (10b). For the variances, we find from 
(7a), (7b): 


& 
VAR Tc, = Y, VAR Tui + Y (ajih!? — aji-hi), (11a) 
i=] iy] 


& 
VAR Tc; = ¥ VAR Tui + Y (apih?? — aji?+hi) + VAR Ty, (11b) 
i=1 inj 


where fh}? = ET%; denotes the ordinary second moment of T'y;. The 
cycle time variance is finally given by 


VAR Tc = (1 — ay) -[VAR Tc, + €;7] + a-[VAR Tc; + c/*]—c*.  (11c) 


The second moments of the cycle times follow from the definition 
VAR Tc = c® — ce’. 


IV. QUEUING ANALYSIS OF M/G/1 SYSTEMS 


Based on the concept of the conditional cycle times of Section III, 
a queuing analysis is given for multiqueue systems of the type M/G/1 
with general overhead times and ordinary cyclic service by means of 
an imbedded Markov chain. Basically, the derivation follows the 
approach of Hashida and Ohara.” 


4.1 Probabilities of state 


For an exact analysis, the state of the system at a time ¢ has to be 
defined such that all past history is summarized in it so that the future 
development of the system state process is completely determined 
from it. In the present case, the system state could be described by a 
vector {Ni(t), No(t), ---, Nz(t), 1(t), Xo(t)}, where N,;(¢) defines the 
number of waiting customers in queue j, ] = 1, 2, --- , g, I(t) points to 
the present location of the server within the cycle, and Xo(¢) specifies 
the age of the current service (or overhead) phase of the server. An 
exact analysis on this base seems not to be feasible. 

In the following analysis, only the state N, of a particular queue 7 is 
considered. Moreover, the analysis does not apply to continuous time 
but is restricted to a set of special points, namely the scan instants (or 
departure instants) of the considered queue j. The time intervals 
between the scan instants of queue / are the conditional cycle times 
Lc; and Tc;; the influence of all the other queues on the queue length 
process in queue / is completely expressed by those cycle times. 
Although the following imbedded Markov chain solution is formally 
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exact, the analysis approach is approximate since Tc, Tc’, are 
assumed to be independent and identically distributed (iid) variables; 
the expressions for their pdf are only approximations, too. 

The outlined method only renders results for the particular queue 7 
under consideration. In the case of unsymmetrical systems, the pro- 
cedure must be repeated for the other queues, too. For ease of reading, 
we suppress the subscript 7 in the following treatment, 1.e., we write 
A, h, p, c’, c”, «++ instead of A;, hj, pj, Cj, C7 ++ 


4.1.1 State distribution at scan instants 


We assume that the queuing sytem is in the stationary state. Let NV 
be the number of waiting customers at the server arrival instant (scan 
instant) of a particular queue. We are interested in the stationary 
distribution 


Pr=P{N=n}, n=0,1,2,-+-. (12) 


Because of the memoryless property of the arrival process, the system 
state of the considered queue forms an imbedded Markov chain at the 
discrete set of scan instants (renewal points). The stationary distribu- 
tion satisfies the equation (see Ref. 17, pp. 167-174): 

n+1 


Pn = Po*Pon + 3 Pm*Pmn, n=1,2,---, (13a) 
m=] 


where the transition probabilities p,,, are given by 


% Nt)? 
| e~™ oe ee - dC” (t), m>0 
t=0- 





(n —m+ 1)! 
Pai = (13b) 
. At)” 
| es mw) - aC’ (t), m= 0. 
oie ni 


Together with the normalizing condition 


> Prez I, (13c) 
n=0 


the stationary probabilities of state at the scan instants are completely 
determined by the set of equations (13a), (13b), and (13c). Introducing 
the probability generating function of the state distribution p,, n = 0, 
1 De einen. 


G(x) = ¥ pnx", (14) 


n=0 


680 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1979 


we obtain after some algebraic manipulations 


. xpco'(z) — oc°(Z) 
x — oc"(Z) 


Note that G(x) is completely expressed by po and the LsTs of the two 
conditional cycle times. Using the identity G(1) = 1, we find from (15) 
through evaluation of lim, G(x) by L’Hospital’s rule 

1 — Ac” 


= ——____—_-= ] - a. 1 
oP 1—A(c” —c’) ° a8) 


G(x) = Po ; where z=A(1-~<x). (15) 


The latter identity can be shown by using equations (10a), (10b), and 
(5). } 

The expected number of waiting customers at the scan instant 
follows from 


d 
EN = ae G(x) 


x=1 
This results in 


Ac’. (1 — Ae”) + ce’ A720" + 2 — 2Xe”) 


=r oF 
7 21 — are”)? 


(17) 


where c’?) = ET?, and c” = ET?2-. 


4.1.2 State distribution at departure instants 


Let N* be the number of waiting customers within the considered 
queue which are left behind by a departing customer of that queue 
~ with distribution 


pi = P{N* =n}, n=0,1,2,--- (18) 
and generating function 


G*(x) = D phx”. (19) 


n=0 
The probability p7 can be expressed through the probability of having 
m customers at the scan instant given that the considered queue is not 


empty, Dm/(1 — po), and the probability of n — m + 1 new arrivals in 
that queue during the subsequent service time of one customer. Hence, 


n+1 a n—m+1 
* __ Pm —Nt (Af) 
pr= % . . (n —m + 1)! eon 





n=0,1,2,---. (20) 


MULTIQUEVE SYSTEMS 681 


Substituting (20) in (19) and interchanging the order of summation 
and integration, we find 


G*(x) = at =. gx(z), 3 (21) 


where 

z=XA(1—~x). 
Therefore, it follows for the expected number of customers at the 
departure epoch: 


ENe “GG ies, (22) 
dx a 1- Po 

On the other hand, EN* equals the expected number of customers 
which have arrived during the sojourn (waiting + service) time of the 
departing customer (for this, consider as an example the queue disci- 
pline FcFs, first come, first-served). Hence, EN* = A-(w + h), where 
w = ETw denotes the average waiting time in the considered queue 
(service being excluded). Solving for w, we find with (22) 


w=5- (| -1}. (23) 





4,2 Delay analysis 


For the following derivation, the queue discipline FCFS is assumed. 
Let Tw be the waiting time which an arbitrary customer of the 
considered queue (in the following denoted by “test customer’’) has to 
undergo with pdf W(t) and LsT ¢w(s). Through an analogous reason- 
ing as in the previous section, pz; can alternatively be considered as 
the distribution of the number of arriving customers during the sojourn 
time T's of the test customer. Since T's = Tw + Ty and since Tw and 
Tx are independent of each other, the pdf of Ts is the convolution of 
W(t) and H(t), symbolized by W(t) © H(t). Hence, 


pt= | en AO awn) © H(t)), n=0,1,2,---. (24) 
t=0— : 


Applying (19) in (24), we find G*(x) = ow(z)-¢n(z), where z = A(1 
— x), which, with (21), finally results in 


1—Ac” 1 — oc: (s) 


= ee 25 
ow(s) a Sol oe (25) 
From (25) we find for the mean waiting time 
d ce’) Ac” (2) 
rrr Sd ee Cos 2) 
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Equation (26) reveals that the mean waiting time depends basically on 
the first and second moments of the conditional cycle times. Note also 
that (26) agrees with (23) when the corresponding results for po, EN, 
c’,c”, c’™, and c”™ from eqs. (16), (17), (10a), (10b), and (11a), (11b), 
respectively, are inserted. 

It may be mentioned that the result (26) can also be derived directly 
through the application of renewal theory and Little’s law: An arriving 
test customer of the considered queue meets either a cycle C’ or C” in 
progress. Since the arrival process is a Markovian process, the proba- 
bilities of meeting a cycle C’ or C” is simply the weighted ratio of 
frequencies, 1.e. (1 — a)-(c’/c) ora: (c”/c), respectively. According to 
our approximation assumption, the conditional cycle times Tc and Tc- 
are iid-variables. Thus, the average residual cycle times are c’”/2c’ 
and c”’/2c” according to renewal theory (see, for example, Ref. 17, 
pp. 158-161). The average waiting time w consists of the average 
residual cycle time and the product of the mean cycle time c” and the 
average number L of customers met at the arrival instant of the test 
customer; the latter one can be expressed through Little’s law (see, for 
example, Ref. 17, pp. 156-158) through L = A-w. The average waiting 
time w can now be balanced as 

, 7(2) ” (2) 


peg a 
c 2e c 2¢ 





+ (Aw)c”. (27) 


Solving for w, we yield precisely the result (26) from (27). 

The pdf W(t) can be obtained by the inversion of (25) either through 
a partial fraction expansion (in case of rational LsTs), by the numerical 
inversion technique of D. Jagerman,’® or by an approximation using 
the ordinary first and second moments.”” 

Finally, we mention that (25) includes the exact result for the 
limiting case g = 1 of a single cyclic queue with overhead. Furthermore, 
another limit with zero overhead can be derived from (25); this case 
represents the worst case with respect to the approximation accuracy 
(see Section VI). 


V. QUEUING ANALYSIS OF M'“1/G/1 SYSTEMS 


In this section, the solution of Section IV for single Poisson arrivals 
(M) is generalized to batch Poisson arrivals (M'!) in every queue. 
The analysis follows analogously to Section IV; i.e., we consider all 
processes with respect to a particular queue /j. Again, the subscript 7 
will be suppressed for ease of reading. 


5.1 Probabilities of state 
5.1.1 Arrival process 
Customers of the considered queue arrive in batches of size K with 


distribution 
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gz = P{K =k}, k=0, 1, 2,--- (28a) 
and probability generating function 


Q(x) = 2 qex®. (28b) 


The interarrival times of batches are exponentially distributed with 
mean 1/Ag = EK/A, where A and Az, are the arrival rates of customers 
and batches within the considered queue, respectively. 

Let Ng(¢) be the number of batch arrival instants in (0, t) with 
distribution 


P{Na(t) =n} nn -eF =n = 0,1,2,--- (29a) 


and probability generating function 


co r t n 
g(x, t) = YX Sa . est, xn" = esBt—x) | (29b) 
n=0 * 

Finally, let Na(t) be the total number of customers arriving at the 
considered queue in (0, ¢). Then, the probability generating function 


of the distribution P{Na(t) = k}, k = 0, 1, 2, --- , is given by 


iv] 


h(x, t)= Yo P{Na(t) =k} + x*® = g(Q(x), t) = e?2-@1_ (30) 


k=0 


5.1.2 State distribution at scan instants 

Since the cycle time approximation of Section III holds for GI/G/1 
cyclic queues, the same pdfs C(t), C’(é), and C”(¢) can be used for 
batch arrival processes. As in Section 4.1., let p, be the stationary 
probability of state for nm customers waiting within the considered 
queue at the scan instants. The transition probabilities pn, in (13a) 
are now 


a 4) 


. P{Np(t) = v} 


t=o- "" 
.P{Na(t) = n—m-+1|Na(t)=7}-dC"(t),  m>0 
Pmn = (31) 
| S P{Na(t) =r} 


=j— »=0 
»-P{Na(t) = n| Na(t) = v}- dC’ (6), m= 0. 
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The probabilities of state p, are completely determined by (13a), (31), 
and (13c). The application of the generating function results finally in 
the same expression for G(x) as in (15), however, with z = Ag-[1 — 
Q(x) ], where Q(x) is defined by (28a), (28b). Also, for po the identical 
result is obtained as in (16). Further results can easily be derived 
analogously as in Section 4.1. | 


5.1.3 State distribution at departure instants 
Using the same definitions for pz and G* (x) as in Section 4.1.2, we 
find, instead of (20), 


jie y P{Na(t) =v} - P{Na(t) 


m=1 1 — Po t=-0— v=0 
=n—-m+1|Na(t) =v}dH(t), n=0,1,2,---. (82) 


This again results in the same expression for G*(x) as in (21) with z 
= Ag-[1 — Q(x) ], from which further results could be derived analo- 
gously. | 





5.2 Delay analysis 


- Following the method outlined in Secton 4.2, p* is also the distri- 
bution of the number of arriving customers during the sojourn time of 
a test customer of that queue. The number N* of customers left behind 
in the considered queue by the departing test customer is now built up 
from two components: 


N* =Ni+t+WN3, 
where 


Ny =the number of customers that had arrived together with the 
test customer in one batch but that were behind the test 
customer 


Nz =the number of customers that had arrived in subsequently 
arriving batches during the sojourn time T's of the test cus- 
tomer. 


Let r, = P{N¥ =n}, n = 0, 1, 2, ---, be the probability that the 
departing test customer leaves n customers behind which had arrived 
together with the test customer in one batch. The test customer 
arrived in a batch of size K = k with probability (see Ref. 20) 


hgn 
EK’ 
The test customer is first, second, --- , Ath in the batch of size k with 
probability 1/k. Thus, q?/k defines the probability that the test 


k=1,2,3,---. (33) 


qi = 
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customer arrived in a batch of size k in (k — n)th position, n = 0, 1, 2, 
---,k—1. Then, | 


y dk, n= Q, 1, Z, eee (34) 


and probability generating function 
1 1-Q(x) 
R — n Le = —_—_—_——- 3 ‘ 


Now, we can establish the relation between pr; and Ts analogously as 
in Section 4.2: 


(35) 


pt=| | 3 pavt=w)-3 Pent (y =n -n1 No =7) 


p=0 


PUNa(t) =») |-d0me ® H(t)), n=0,1,2,---. (36) 


In (36), the bracket term expresses the probability of new arrivals 
within a sojourn time of length ¢ through consideration of all principal 
possibilities of batch configurations of the departing test customer. 
Introducing r, and R(x) from (34) and (35) and applying (19) on 
(36), we find after some intermediate calculations 
G* (x) = ow(z)-dn(z)- R(x), (37) 


where 


z=Az[1 — Q(x) ]. 


Equating both expressions in (21) and (37) yields the final result 


_1—d’c” 1- $c-(s) 1 
ow(s) eae Fm ES TT, (38a) 


where x = f(s) the solution of 
s = Ap{1 — Q(x)]. (38b) 


From (38a), (388b), we find for the mean waiting time of a customer 


c’) Ac”? ce” ER? 
= ( — + —______} + —______.. | -—__-- 1]. 39 
- oe 2(1 - os! 2(1 — Ac”) if 2) 
Note that the mean waiting time consists of two terms; the first term 
is identical with that of an M/G/1 system [see (26)], whereas the 
second term expresses the influence of batch arrivals. 


Analogously to Section 4.2, the mean waiting time can be derived 
directly. Let w(z) be the mean conditional waiting time of a customer 
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who is Uth in his batch. For w(1), a similar balance can be stated as in 
(27): 
, 1(2) 


w(l) = (1 a) 


” (2) 
Cc 
Gg ae hap el” (40a) 


€ 2c"” 
The relationship between w(1) and w(t) is 
w(t) = w(1)+(i-1)-c’. (40b) 


The mean waiting time w, irrespective of the test customer’s position 
within the batch, follows by averaging over the conditional waiting 
times. Thus, with (33) and (40b), we have 





“ kqz 1 © c” | EK* 
—| ————- « — » = 1 + ar ae aa 1 . 4l 
Ub oR pe  e Ee ate) 
Inserting (41a) in (40a) and solving for w(1) yields 
¢’ ?) Ae” (2) Ae”) EK? 
1) = — + ————_ + —___-- | ——— - 1 | . 41b 
WN  oek Bane) Ane Be oy 


The mean waiting time w is completely determined with (41a), (41b) 
and agrees with (39). 

Finally, we give the explicit results for w in the case of two special 
batch size distributions. For constant batch size k, 1.e., qi = 5(1, k), we 
find 

k-1 c” 


2 Tne" oe 


w| MxVG/i =wi|ujcon + 
In the case of geometrically distributed batch sizes, which are defined 
by gi=q'-(1—q),t=0,1,---, and g = (EK + 1)/EK, the result is 


" 


6 


es ae (42b) 


w| mixyon = w| ayer + EK: 


The expressions (42a), (42b) demonstrate at first the increase of the 
waiting time through the batch Poisson arrival process compared to 
the pure Poisson arrival process and, second, the increase of w through 
geometrically distributed batches against constant batches. 


VI. NUMERICAL RESULTS 


In this section, the results of the approximate analysis are validated 
by computer simulations. Further results are given to show various 
properties of cyclic queuing systems. 


6.1 Cycle time variance for ordinary cyclic service 


Since the mean cycle time c according to (1) is always exact, the 
approximation accuracy can be judged in a first step by the cycle time 
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variance VAR Tc (note that even the pdf C(¢) of the cycle time would 
not be sufficient for a complete validation since successive cycle times 
are not independent of each other; for a more complete validation, 
some covariance measure should be considered, too). We expect very 
good accuracy for low traffic (since the independence assumption is 
asymptotically exact for zero arrival rates) as well as for heavy traffic 
(since each of the queues contributes in the limit with a full service 
time to the cycle so that the cycle times become independent of each 
other again). 


10 


NEW THEORY 
——— HASHIDA/OHARA 
; SIMULATION RESULT 


WITH 95% CONFIDENCE 
LEVELS 


NO 


CYCLE TIME VARIANCE (VAR T,) 


oO 
on 


0.2 





0 0.2 0.4 0.6 0.8 1.0 
SERVER UTILIZATION (po) 


Fig. 3—-Accuracy of cycle time variances 
Parameters: g = 2 symmetrical queues M/D/1 and M/H2/1 
cu = 2.0 coefficient of variation of service times for M/H2/1 
h, = h2 = 1 average service times 
Co = 0.2 and 1.0, constant overhead. 
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Table |—Cycle time variance for various G//M/1 queues with 
ordinary cyclic service 


Parameters: g = 10 symmetrical queues; A, = 1, u; = 0.1 (0.5), A; = A, 7 = 1, 2, «++ , 10; 
Constant overhead times; D/M/1: Equal phase shift between arrival instants; H2/M/1: 
Interarrival time coefficient of variation c, = 2.0; M'*!/M/1: Constant batch size k = 4. 


Parameters VAR T¢ (simulation) ote 
Po co D/M/1 E./M/1) M/M/1 H2/M/1 M“7M/1 GI/M/1 
0 1.0 0 0 0 0 0 0 
0.4 1.0 1.83 1.88 1.91 1.88 1.66 1.38 
0.6 1.0 6.60 5.38 5.14 5.03 4.61 3.25 
0.8 1.0 21.90 15.40 14.50 12.90 11.00 9,20 
0.909 1.0 10.00 10.00 10.00 10.00 10.00 10.00 
0 5.0 0 0 0 0 0 0 
0.2 5.0 2.23 2.77 2.78 2.81 2.64 2.38 
0.4 5.0 11.10 7.85 7.59 7.28 6.66 5.80 
0.6 5.0 21.70 13.00 11.30 10.70 10.40 10.30 
0.667 5.0 10.00 10.00 10.00 10.00 10.00 10.00 


In Fig. 3, VAR Tc is shown dependent on the server utilization po in 
case of g = 2 symmetrical queues of the types M/D/1 and M/H,/1, 
each with two cases of constant overhead. As expected, the cycle time 
variance depends largely on the pdfs of the service and overhead times. 
The solid curves of the new approximation with the concept of condi- 
tional cycle times compare generally better with the simulation than 
the previous theory by Hashida and Ohara,”” especially for low over- 
head. From a large number of computer simulations for M/G/1 sys- 
tems, we made the following qualitative observations: 

(zt) The cycle time variance accuracy decreases with increasing 

number of queues and increasing service time variance. 

(7) The cycle time variance accuracy increases with increasing 
overhead and for approaching the low or heavy traffic region. 

(iii) Observations (i) and (ii) apply to the new and old theory; the 
concept of conditional cycle times, however, yields generally a 
better accuracy. 

Since the approximation for the pdf of the cycle time is independent 
of the arrival process type, it is interesting to know how the actual 
cycle time variance depends on various process types. For comparison, 
five different GI/M/1 systems with g = 10 queues (the accuracy is 
generally better for g < 10), two cases of overhead, and five cases of 
load have been considered (see Table I). Summarizing, we make the 
following observations: 

(z) The cycle time variance depends indeed on the arrival process 
type. This dependence decreases, however, as the load ap- 
proaches the low or the heavy traffic regions. 

(zz) For medium loads, the cycle time variance may decrease as 
the arrival process peakedness increases. 

(uz) The approximation generally underestimates the true cycle 
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time variance. The accuracy increases with the overhead, the 
peakedness of the arrival process, and as the load approaches 
the low or heavy traffic region. 

At first sight, observation (iz) is counterintuitive and surprising 
since the mean waiting time generally increases with the arrival 
process peakedness (see, for example, Fig. 9). However, regular arrival 
patterns may result in very short and very long cycles since many idle 
cycles could be produced after a service until the next arrival occurs. 


NEW THEORY 
m—— —— HASHIDA/OHARA 
> SIMULATION RESULT WITH 


95% CONFIDENCE LEVELS 


MEAN WAITING TIME (w)) 





0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 
SERVER UTILIZATION (po) 


Fig. 4—Accuracy of mean waiting times for cyclic queuing systems M/D/1 


Parameters: g = 2 symmetrical queues 
h, = hz = 1 average service times 
Co = 0.2 and 1.0, constant overhead. 
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Fig. 5—Accuracy of mean waiting times for cyclic queuing systems M/H»/1 


Parameters: g = 2 symmetrical queues 
h, = h,. = 1 average waiting times 
Co = 0.2 and 1.0, constant overhead 
cy = 2.0 coefficient of variation of service times. 


On the contrary, batch arrivals may stabilize the cycle time since many 
cycles consist of one service time and the overhead only. Although 
these characteristics depend largely on the parameter combination, 
they indicate some interesting effects which may be important for 
applications and theory as well. 


6.2 Mean waiting time 


Since the mean waiting times in (26) are basically dependent on the 
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first two moments of the conditional cycle times, we can expect the 
same accuracy trends as for the cycle time variance. Figures 4 and 5 
show results for systems of the type M/D/1 and M/H:2/1 with two 
symmetrical queues for low and high overhead. The accuracy for M/ 
D/1 is excellent, whereas for M/H2/1 and low overhead the mean 
waiting time is underestimated. In any case, the new approach yields 
a better accuracy compared to Ref. 13, which results from the condi- 
tional cycle time concept. 
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Fig. 6—Accuracy of mean waiting times for cyclic queuing systems M'“!/D/1 


Parameters: g = 10 symmetrical queues 
h; = 1 average service time, j = 1, 2, ---, 10 
Co = 1.0 and 5.0, constant overhead 


EK = 4 constant (average) batch size. 
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Fig. 7—Accuracy of mean waiting times for cyclic queuing systems M'!/H,/1 
Parameters: g = 10 symmetrical queues _ 


h; = 1 average service time, j = 1, 2, ---, 10 

Co = 1.0 and 5.0, constant overhead 

EK = 4 constant (average) batch size 

Cu = 2.0 coefficient of variation of service times. 


Figures 6 and 7 show the results for g = 10 symmetrical queues for 
systems M'“!/D/1 (Fig. 6) and M™!/H,/1 (Fig. 7), each with constant 
or geometrically distributed batch sizes, low and high overhead. All 
cases of batch arrival processes show an excellent accuracy. Many 
other validations have also shown that the accuracy is far less depend- 
ent on the parameters g, co, or G compared to single Poisson arrivals. 
This results from the fact that the cycle time analysis yields the best 
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accuracy in case of batch arrivals; also, the contribution of the batch 
arrivals to the mean waiting time w dominates the expression (39) for 
larger batch sizes. 

Another study on the influence of the service process type G and 
arrival process type GJ on the mean waiting time w in case of ordinary 
cyclic service is shown in Figs. 8 and 9 for zero, low, and high overhead. 
The M/G/1 curves with overhead are analytic results according to 
(26), whereas the GI/M/1 curves are simulation results; the results for 
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Fig. 8—Influence of service process type for cyclic queuing systems M/G/1 


Parameters: g = 10 symmetrical queues 
h; = 1 average service time, 7 = 1, 2, ---, 10 
Co = 0, 1.0, 5.0, constant overhead 


Systems M/D/1, M/M/1, M/He/1 (cu = 2.0). 
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zero overhead are exact and have been drawn from standard queuing 
tables by the author.’ The main conclusions from Figs. 8 and 9 are: 


MEAN WAITING TIME (w;) 





(t) For M/G/1 systems with ordinary cyclic service, the influence 
of the service process decreases with increasing overhead. 

(ti) For GI/M/1 systems with ordinary cyclic service, the influence 
of the arrival process does not remarkably decrease or may 
even increase with increasing overhead (see also Figs. 6 and 7 
for batch arrivals). 
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Fig. 9—Influence of arrival process type for cyclic queuing systems GI/M/1 
Parameters g = 10 symmetrical queues | 
h; = 1 average service time, 7 = 1, 2, ---, 10 
Co = 0, 1.0, 5.0, constant overhead 
Systems D/M/1, M/M/1, Ho/M/1 (ca = 2.0). 
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These properties are most important for applications since they 
show that the results are much more sensitive to arrival processes than 
to service processes and that the usual approximation of arrival proc- 
esses by Poisson processes may result in a quite dramatic error in the 
performance estimation. Therefore, future analytic studies on cyclic 
queuing systems should aim more to the generalization of arrival 
processes. 


CYCLE (1,2,3,4) 


1,2,3,4) 


MEAN WAITING TIME (wj, j 





QUEVE NUMBER 


Fig. 10—Unbalanced load performance of cyclic queuing systems M/D/1 


Parameters: g = 4 queues 
h; = 1,7 = 1, 2, 3, 4, average service times 
u; = 0.05, 7 3, 4, constant overhead 
Ai:A2tA3:A4 2:1 arrival rate ratios. 


= 1, 2, 
= 4:3 
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6.3 Equity of service for unbalanced load 


So far, we have concentrated on cases of ordinary cyclic service. The 
final study shows how cyclic priority service can be used to achieve 
equity of service in cases of unbalanced load. In this case, queue 1 is 
served up to /, times, queue 2 up to J, times, ---, queue g up to J, 
times within a cycle, so that h, i, +--+, 1, could be considered as cycle 
priorities. Figure 10 demonstrates the use of cyclic priority service in 
case of unbalanced load in a system M/D/1 with g = 4 queues and 
arrival rate ratios \i:A2:A3:\, = 4:3:2:1. In case of ordinary cyclic 
service with the cycle {1, 2, 3, 4}, the unbalanced load produces also 
unbalanced waiting times with increasing absolute load po. The dashed 
curves show the result of cyclic priority service where /;:lo:l3:l, = A1: 
A2:A3:A4 with the cycle {1, 1, 1, 1, 2, 2, 2, 3, 3, 4}. For small po, both 
schedules do not remarkably differ in performance. In case of higher 
po, the unbalanced load effects can be compensated for by a cyclic 
priority service. 


Vil. CONCLUSION 


This paper provides a new approximate analysis for cyclic queuing 
systems M'*!/G/1 with batch Poisson arrivals, general service and 
overhead times, and ordinary cyclic service. The method allows a 
relatively easy evaluation of numerical results. The accuracy of the 
method has been validated by computer simulations. In addition to 
the analysis method, a new stability criterion for systems GI/G/1 with 
general cyclic service is developed. A number of traffic studies are 
reported revealing more insight in the traffic performance of cyclic 
queuing systems. 


Vill. ACKNOWLEDGMENTS 


The author wishes to thank J. M. Holtzman and H. Zucker for many 
stimulating discussions. The constructive comments by two reviewers 
are greatly appreciated. 


REFERENCES 


1. B. Avi-Itzhak, W. L. Maxwell, and L. W. Miller, “Queuing With Alternating Priori- 
ties,” Opns. Res., 13 (1965), pp. 306-318. | 
2. L. Takacs, “Two Queues Attended by a Single Server,” Opns. Res., 16 (1968), pp. 
639-650. 
3. R. B. Cooper and G. Murray, “Queues Served in Cyclic Order,” B.S.T.J., 48, No. 3 
(March 1969), pp. 675-689. 
4. R. B. Cooper, “Queues Served in Cyclic Order: Waiting Times,” B.S.T.J., 49, No. 3 
(March 1970), pp. 399-413. 
J. S. Sykes, “Simplified Analysis of an Alternating-Priority Queuing Model with 
Setup Times,” Opns. Res., 18 (1970), pp. 1182-1192. 
. M. Hisenberg, “Two Queues with Changeover Times” Opns. Res., 19 (1971), pp. 386- 
401. 
. M. Eisenberg, “Queues with Periodic Service and Changeover Time,” Opns. Res., 20 
(1972), pp. 440-451. 


I DO A 


MULTIQUEVE SYSTEMS 697 


8. O. Hashida, “Analysis of Multiqueue,” Review of the Electr. Comm. Laboratories, 
Nippon Telegraph and Telephone Public Corp., 20 (1972), pp. 189-199. 

9. T. S. Brodetskaya, “Capacity-Time Analysis of a Cyclic-Service Queuing System.” 
Kibernetica, No. 2 (1975), pp. 64-68 (in Russian). Translation: New York: Plenum, 
1976. 

10. S. S. Nair, “Alternating Priority Queues with Non-Zero Switch Rule,’ Comp. and 
Opns. Res., 3 (1976), pp. 337-346. 

11. M. Eisenberg, ‘““Two Queues With Alternating Service,” unpublished work. 

12. M. A. Leibowitz, “An Approximate Method for Treating a Class of Multiqueue 
Problems,” IBM Journal, 5 (1961), pp. 204-209. 

13. O. Hashida and K. Ohara,, “Line Accomodation Capacity of a Communication 
Control Unit,” Review of the Electr. Comm. Laboratories, Nippon Telegraph and 
Telephone Public Corporation, 20 (1972), pp. 231-239. 

14. S. Ekberg, “Strategies for a Telephone Central Marker Handling Different Queuing 
Jobs,” Proceedings TIMS XX, Tel Aviv (1973), pp. 523-528. 

15. P. Kuehn and M. Langenbach-Belz, “Effectiveness of Device Servicing Strategies in 
Real Time Systems,” Fourth Annual Conference of the German Informatics 
Society, Berlin, October 9-12, 1974, Springer-Verlag, 1975, pp. 463-472. Bell 
Laboratories Translation Series No. TR 77-72. 

16. S. Halfin, ‘““An Approximate Method for Calculating Delays for a Family of Cyclic- 
Type Queues,” B.S.T.J., 54, No. 10 (December 1975), pp. 1733-1754. 

17. R. B. Cooper, Introduction to Queuing Theory, New York: MacMillan, 1972. 

18. D. Jagerman, “An Inversion Technique for the Laplace Transform with Application 
to Approximation,” B.S.T.J., 57, No. 3 (March 1978), pp. 669-710. 

19. P. J. Kuehn, “Tables on Delay Systems,” Institute of Switching and Data Technics, 
University of Stuttgart, 1976. 

20. P. J Burke, “Delays in Single-Server Queues with Batch Input,” Opns. Res., 23 
(1975), pp. 830-833. 


698 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1979 


Copyright © 1979 American Telephone and Telegraph Company 
THE BELL SYSTEM TECHNICAL JOURNAL 
Vol. 58, No. 3, March 1979 
Printed in U.S.A. 


A Comparison of the Performance of Four 
Low-Bit-Rate Speech Waveform Coders 


By J. M. TRIBOLET, P. NOLL, B. J. MCDERMOTT, 
and R. E. CROCHIERE 


(Manuscript received May 23, 1978) 


Subjective ratings were obtained for four different speech waveform 
coders of varying complexity at each of three bit rates (24, 16, and 9.6 
kb/s). ATC (adaptive transform coding) is rated the highest and 
ADPCM -F (adaptive differential PcM with a fixed predictor) the lowest, 
regardless of bit rate. Although there is large variability in the 
ratings due to different talkers and listeners, SBC (sub-band coding) 
is rated higher than ADPCM-F and about equal to ADPCM-V (adaptive 
differential pcm with a variable predictor) for most talkers and 
listeners. A weighted combination of two objective measures, one 
accounting for noise and the other for bandwidth effects, appears 
promising as a predictor of the subjective quality ratings. 


I. INTRODUCTION 


The quality of the reproduced speech from waveform coders can 
usually be improved by increasing their complexity. However, increas- 
ing the complexity also usually increases the cost. The practical 
problem in choosing a coder, at the current state of the art, becomes 
one of knowing how much loss in quality is sacrificed when opting for 
a less complex (less expensive) coder. 

This paper compares subjective quality ratings for four different 
speech waveform coder algorithms of varying complexity. The algo- 
rithms, rated in order of their complexity are: ADPCM-F (adaptive 
differential pulse code modulation with a fixed predictor), sBc (sub- 
band coding), ADPCM-v (adaptive differential PCM with a variable 
predictor), and ATc (adaptive transform coding). Each of these four 
algorithms was studied at three different transmission rates: 24, 16, 
and 9.6 kb/s. These coders were chosen because they represent a 
number of different classes of coding techniques ranging from relatively 
simple (inexpensive) to highly complex (costly) schemes. The choice 
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was also influenced by the availability of coder software at the time of 
the experiment. All coders are “non-pitch-predicting.” 

Since the coders being studied produce a broad range of qualities 
and types of degradations, the data also provided an opportunity to 
evaluate the relative merits of several objective measures that have 
been proposed for predicting quality ratings.’ 


li. THE WAVEFORM CODERS: A BRIEF DESCRIPTION 


The four waveform coders used in this experiment are depicted in 
Figs. 1 through 4 and are briefly described below. 


2.1 ADPCM with a fixed predictor (ADPCM-F) 


The ADPCM-F coder is the simplest of the four coding techniques. As 
seen in Fig. 1, it consists of a quantizer with an adaptive step-size and 
a first-order fixed predictor. The step-size adaptation is based on the 
one-word memory approach of Jayant, Flanagan, and Cummiskey.’ 
The number of bits used in the quantizers are 3, 2, and 1 bit, respec- 
tively, corresponding to the transmission rates of 24, 16, and 9.6 kb/s. 
In the case of the 9.6-kb/s transmission rate, the coder reduces to that 
of an adaptive delta modulator with a sampling rate of 9.6 kHz. The 
output of the 9.6-kb/s coder was filtered with a 0 to 2800-Hz lowpass 
filter to remove the high frequency noise. The parameters used for the 
ADPCM-F coders are close to those proposed by Jayant.° 


2.2 Sub-band coding (SBC) 


In the sub-band coder, the speech band is partitioned into sub- 
bands.*” Each sub-band is effectively lowpass-translated and sampled 
at its Nyquist rate. It is then preferentially encoded using APCM 
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Fig. 1—Block diagram of ADPCM-F. 
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(adaptive step-size PCM) encoding with a backward step-size adaptation 
algorithm. The number of bits per sample in each band is chosen 
according to perceptual criteria for that band. On reconstruction, the 
sub-bands are decoded and bandpass-translated back to their original 
bands. They are then summed to give a replica of the original speech 
signal. 

Figure 2a shows an implementation of the sub-band coder based on 
an integer-band sampling technique.* The speech band is partitioned 
into N sub-bands by bandpass filters BP; to BPy. Typically, four to 
five bands are used and, at 9.6 kb/s, gaps are permitted between the 
bands to conserve bandwidth and, therefore, bit rate, as is illustrated 
in Fig. 2b. 

The complexity of the sub-band coder is somewhat greater than that 
of the ADPCM-F coder. Using recent ccpD (charge-coupled device) tech- 
nology, the filters can potentially be implemented very efficiently. A 
single APCM coder can be multiplexed between the N sub-bands. The 
multiplexer requires digital logic and a ROM (read only memory) for 
storing the multiplexing pattern. 

The coder configurations at 9.6 and 16 kb/s are those of examples A 
and D in Ref. 5. The 24-kb/s coder uses the same filters as the 
16-kb/s coder, but the number of bits per sample per sub-band is 
increased by one. 


2.3 ADPCM with a variable predictor (ADPCM-V) 


The ADPCM-V coder is a more sophisticated version of ADPCM,°” as 
seen in Fig. 3. The input data are first buffered and delayed. From this 
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Fig. 2—(a) Block diagram of sBc. (b) Sub-band partitioning. 
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Fig. 3—Block diagram of ADPCM-V. 


buffered block of speech, a short-time estimate of the variance of the 
input speech is computed and used to control the step-size of the 
quantizer. This local variance estimate is also quantized and transmit- 
ted for use in the receiver. 

The predictor is an eighth-order adaptive predictor (no pitch pre- 
diction is involved in this scheme). The coefficients of the predictor 
are computed according to the relation 


hy = Ryn, YN, (1) 


where Ry and rw are the matrix and vector of autocorrelation coeffi- 
cients of the data in the buffer. The predictor coefficients are also 
quantized and transmitted to the receiver as side information. The 
total transmission of side information requires about 2 kb/s of data. 
In the implementation of the ADPCM-V coder, 3, 2, and 1 bits/sample 
were used for the respective transmission rates. A sampling rate of 
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8 kHz was used. Since approximately 1.5 to 2 kb/s of additional 
information are required to be transmitted with this scheme, the actual 
transmission rates represented by this coder were 26, 18, and 9.6 kb/s 
instead of 24, 16, and 9.6 kb/s, which was used in the other coders. 

The complexity of the ADPCM-V coder is primarily dominated by the 
implementation of the adaptive predictor and the computation of the 
predictor coefficients. As seen in eq. (1), this involves an autocorrela- 
tion computation and the solution of 8 simultaneous linear equations 
every 8 to 16 ms. This must be done using high-speed digital compu- 
tation. Therefore, the complexity of the ApPcM-V coder is substantially 
greater than that of the ADPCM-V or sub-band coders. 


2.4 Adaptive transform coding (ATC) 


The adaptive transform coder is analogous in some respects to the 
sub-band coder in that it divides the speech band into a number of 
frequency components.® The resolution (number of bands), however, 
is generally much finer than that used in the sub-band coder, and the 
translation to the frequency domain is achieved by means of a fast 
transform algorithm. The transform is a 128-point, discrete, cosine 
transform. The transformed coefficients are encoded with APCM en- 
coding. 

Figure 4 is a block diagram of the transform coder. The input speech 
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is buffered and transformed in blocks of data. The output values of the 
transform are smoothed and decimated to 16 values. These 16 values 
are quantized and sent as side information to the coder and the 
decoder, where they are interpolated, yielding a smoother version of 
the short-time spectrum. This smoothed spectrum forms the basis for 
a dynamic bit assignment and step-size adaptation of the quantizers, 
as a function of frequency. The quantizers are then used to encode the 
transform coefficients. 

In the receiver, a similar bit assignment and step-size computation 
is performed and the transform coefficients are decoded. The decoded 
values are then inverse-transformed to give a replica of the input 
speech. 

The degree of complexity of the transform coder is of approximately 
the same order as the ADPCM-V coder. The side information and bit 
assignment computation require high-speed digital computations. 
Some efficiency might be gained using ccpD technology to implement 
the transform. 


ill. SUBJECTIVE EVALUATION 
3.1 Experimental procedure 


Digital recordings of sentences spoken by four talkers (two male and 
two female) were processed by each of the 12 coders. The processed 
utterances were equalized to the same mean power to eliminate loud- 
ness differences. Two analog test tapes were prepared that contained 
different permutations of four random orderings of the 12 coders. The 
talkers were assigned in a balanced design so that each coder was 
represented by the speech of a different talker in each of the random 
orders. Since each of the four talkers had recorded a unique set of 
eight sentences, the sentences were randomly assigned and none 
occurred more than twice. 

Students from the junior and senior classes of local high schools 
served as paid subjects. They listened to the processed speech bin- 
aurally over Pioneer SE700 earphones while seated in a double-walled 
sound booth. Sixty-five subjects judged the 48 coded sentences (4 
coders X 3 bit rates X 4 talkers). They were asked to rate the quality 
of each sentence on a scale from 1 to 9, using a 1 to represent the worst 
quality, 9 to represent the best quality, and the numbers between 1 
and 9 for intermediate evaluations. Before the test session began, they 
judged six representative conditions for practice to familiarize them 
with the task and the range of quality. 


3.2 Results 


An initial analysis of the ratings by the 65 listeners revealed a large 
amount of unexpected variability in the data due to the different 
talkers and listeners. 
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The variability due to the different listeners is illustrated in Fig. 5. 
These histograms show the percent of the listeners who assigned each 
of the nine possible ratings to each of the 12 coders. While these values 
were computed by summing across the four talkers, comparable plots 
for the individual talkers produced essentially the same general pat- 
tern, indicating that the variability in the ratings cannot be attributed 
entirely to the effects of the different talkers. The extremely skewed 
distributions for ATc at 24 kb/s and ADPCM-F at 9.6 kb/s show strong 
agreement among the listeners about which coders had the best and 
worst qualities, but the wide range, and in some cases almost flat 
character, of other distributions indicate that the listeners differ in the 
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Fig. 5—Percent of ratings assigned each coder (65 listeners X 4 talkers). 
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amount and type of distortion that they will tolerate. However, it is 
not the purpose of this paper to examine the sources of listener 
variability but to gain some information about general trends and the 
trade-off between complexity and quality. Since so many of the distri- 
butions showed a lack of unimodality, the median ratings were used to 
compare the evaluations according to the experimental variables: type 
of coder, bit rate, and talkers. 

The variability in the ratings due to the different talkers is shown in 
Fig. 6, where the median rating (bracketed by their 0.95 confidence 
interval) of the 65 listeners for the 12 coders are plotted for each of the 
four talkers. The overall pattern for all talkers shows that aTc had the 
highest rated quality, ADPCM-F the lowest rated quality, and the rating 
for each talker-coder combination drops as the bit rate is reduced. The 
ratings for SBC and ADPCM-V are about the same for all voices except 
female 2, indicating that they have about the same overall quality. For 
the same three voices, ATC at 16 kb/s is rated equal to, or better than, 
the other three coders at 24 kb/s. 

Some specific differences in ratings are consistent with the usual 
male-female voice distinctions, and some are speaker idiosyncratic. 
ATC at 16 kb/s is rated almost equal to 24 kb/s for male voices but 
much lower for female voices. ADPCM-v at 16 kb/s is rated almost 
equal to 24 kb/s for female 1 but almost three categories lower for 
female 2. The coders span a much wider range of ratings at the lower 
bit rates for male 2 than for male 1. The separation of the four coders 
into only two classes for female voice 2 is a unique result in this small 
sample of four voices, but an analysis of the characteristics of her voice 
could provide important information about the effect of different 
voices in digital coding techniques. 


IV. OBJECTIVE MEASURES 


Several objective measures that may be more sensitive to the types 
of degradations produced by waveform coders are described in Ref. 1. 
The data from this experiment provided an opportunity to test these 
theoretically derived predictors of quality ratings. The median ratings 
(bracketed by the 0.95 confidence interval) computed across both 
listeners and talkers are plotted in Fig. 7 and served as a single basis 
for comparing the efficacy of these various objective measures. 


4.1 Segmental S/N ratio 


Perhaps the most widely used measure of performance has been the 
conventional signal-to-noise ratio, although it has not generally been 
found to be a very good indicator of subjective quality. An improved 
measure, proposed by Noll,’” averages the s/n ratio in short (20 to 30 
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Fig. 6—Median rating of 12 coders for each talker. 


ms) segments and has been found to be a more accurate predictor. In 
Fig. 8, the segmental s/n ratio, averaged across talkers, is plotted for 
each coder as a function of bit rate. These plots show that segmental 
s/n ratio is linearly related to the bit rate for each type of coder except 
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Fig. 7—Median rating of 12 coders (65 listeners X 4 talkers). 
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Fig. 8—Signal-to-noise ratio measured segmentally (15-20 ms) as a function of 
transmission rate for 4 types of coders. 


SBC at low bit rates. While it orders ATC and the two ADPCM coders 
relative to each other at each bit rate, it underestimates the quality 
ratings of SBC shown in Fig. 7. In an effort to improve the effectiveness 
of s/n ratio as an indicator of quality, a number of frequency-weighted 
s/n ratio measures were computed, but they also were not highly 
successful in predicting the relative ordering of these four coders.’ 


4.2 Noise-to-signal measure 


Another functional form for measuring the noise-to-signal power 
ratio in coders has been recently proposed’ and analyzed.” This 
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measure was derived through concepts of log likelihood ratios and has 
the following functional form: 


B 
be=10 logo | 14 > ei/s3 |, (2) 
j=l 
where the summation is taken over B = 16 frequency bands spaced 
according to the articulation bands in the range of 200-3200 Hz."'” The 
bar above the equation denotes that /,, is the result of an average over 
(20 to 30 ms) segmental measurements, where e; is the segmental 
noise power and s; is segmental signal power in band /. 

The values of /,,, averaged over talkers, are plotted as a function of 
bit rate in Fig. 9a. This measure is a better predictor of the relative 


ATC 
ge 
ADPCM-V 
ADPCM —F 






20 


25 


PERCENT ARTICULATION BANDWIDTH 





8 9.6 12 16 20 24 28 
TRANSMISSION RATE IN KILOBITS PER SECOND 


Fig. 9—(a) Noise-to-signal measure averaged over 16 articulation bands. (b) Percent 
of articulation bandwidth. 
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ordering of the coders at 24 and 16 kb/s where the distortions are 
primarily due to noise effects. At 9.6 kb/s, additional distortions due 
to bandwidth limitations are introduced by some of the coders. These 
distortions are not accounted for by the /,, measure and do not predict 
the subjective ratings at this bit rate. 


4.3 Percent bandwidth 


A bandwidth measure was defined’’® to account for the loss of 
bandwidth by the coders at low bit rates. The measure is defined as a 
simple percentage of the bandwidth on an articulation scale. Thus, the 
loss of a fixed bandwidth at a lower frequency has more importance 
than at a higher frequency. A coder with a flat frequency response 
from 200 to 3200 Hz is defined as one with a 100-percent articulatory 
bandwidth. 

The bandwidth is measured on a segmental (32 ms) basis, and a 
smoothed spectral estimate of the input and output of the coder is 
computed for each segment. If, at any frequency point, the output 
power is less than, say, 10 dB of the input power, it is counted as a loss 
of bandwidth and is weighted according to the articulation scale. An 
average bandwidth is then computed over the entire utterance. 

The percent bandwidth, plotted in Fig. 9b, complements the J, 
measure in that it is sensitive to differences at 9.6 kb/s and relatively 
insensitive to differences at 16 and 24 kb/s. This measure correctly 
indicates that sBc at 9.6 kb/s has the lowest bandwidth due to the 
spectral gaps depicted in Fig. 2b. It also reflects the lower bandwidth 
of aTc at 9.6 kb/s due to the dynamic bit allocation in the algorithm 
and the 0 to 2800 Hz low-pass filter for ADPCM-F at this bit rate. 


4.4 A combined measure 


By forming a linear combination of the J, and B,, (bandwidth) 
measures, a combined measure, Q, which accounts for both noise and 
bandwidth effects, can be defined. This measure has the form 


Q =A; + Ao ln + As Bu, (3) 


where A, As, and A; are computed by multiple regression techniques. 
The results of this combined measure for A; = —3.78, Az = —0.42, and 
A; = 0.15 are shown as the abscissa in Fig. 10, and the median ratings 
(of the 12 coder-bit rate combinations) as the ordinate. The solid dots 
in the scatterplot are the median ratings of the listeners summed over 
talkers and are the values used to compute the weighting coefficients. 
The four A’s around each point are the median ratings for each of the 
four talkers. The high correlation (0.98) shows that this combined 
measure is a good predictor of the median ratings but will seriously err 
in predicting the ratings of some specific talker-coder combinations, 
particularly at low bit rates. 
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Fig. 10—Prediction of median rating by weighted combination of noise-to-signal and 
percent bandwidth measures. 


V. DISCUSSION 

Although the large variability in the ratings due to the different 
talkers and listeners rendered the quality ratings less precise than 
anticipated, an underlying pattern of relationships can be detected. 
ATC has the highest rating and ADPcM-F the lowest rating, regardless 
of bit rate, for most talkers and listeners. If cost/complexity is of no 
concern, then ATc is the most attractive of these coders. If cost/ 
complexity is of concern, then SBC is an attractive choice since it is 
only slightly more complex than the ADPCM-F coder and about equal 
in quality to the costlier ADPCM-V coder. 

The combination of two objective measures, /J,, and B,, each ac- 
counting for a different type of degradation, appears to be promising 
as a predictor of subjective ratings. Its precision is obscured by the 
interactive effects of the talkers and listeners. Of these two effects, the 
talker interaction is probably the easier to eliminate. The character- 
istics of different voices can be studied, and possibly the factors 
affecting the different coders could eventually be identified. However, 
the listener variability is a more difficult problem. Digital coding 
techniques are producing a variety of new and different types of 
degradations. As the bit rate is reduced, ATC produces a burble, sBc 
produces a reverberant quality, and the ADPCM coders produce a 
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’ signal-dependent noise. The variability in the subjects’ ratings indi- 
cates that the trade-off for these different types of degradations is not 
the same for all people. 
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LED Array Package for Optical Data Links 


By A. ALBANESE and W. S. HOLDEN 
(Manuscript received September 8, 1978) 


A package with an array of six LEDs is described. The design 
represents a first effort at reducing the cost and the size of electro- 
optical interfaces for applications in low-bit-rate transmission sys- 
tems and computer data links. A package containing six GaAs LEDs, 
electrical connections, and a new fiber ribbon connector was fabri- 
cated, mounted, and encapsulated using presently available technol- 
ogy. Each Lep in the array, with 80 mA dc applied, can couple an 
average optical power of 5 .W into a graded-index fiber having 0.22 
N.A. and 55-um core diameter. Cross-modulation between adjacent 
LED signals is smaller than —50 dB. Fiber ribbon connectors fabri- 
cated with matching plates which align the fibers with the light 
output from the LEDs showed insertion losses as low as 0.4 dB. 


1. INTRODUCTION 


Optical fiber technology is of potential use in data links for com- 
puters and switching systems where optical fibers can replace existing 
copper wires.’ In most of these applications, the space and the cost of 
the electro-optical interfaces can be decreased, as in the case of 
integrated circuits (Ics), by using a batch fabrication process and by 
having several components share the same package. A further reduc- 
tion in cost and space would result from the integration of several 
optical devices and several Ics into a single package. This integration 
poses a new challenge for Ic packaging where this package must 
include electronic components, heat sink, electrical connections, elec- 
tro-optical interfaces, and optical connections. 

In the present work, a package with an array of six LEDs is presented 
as an initial effort toward the integration of several electro-optical 
interfaces that share a single package as a way of reducing space and 
fabrication cost. 

Figure 1 is a drawing of the experimental LED array package fabri- 
cated using standard dual-in-line packaging (pip) techniques. The 
package has an array of six homojunction GaAs LEDs of the Burrus 
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Fig. 1—LED array package. 


type,” a thin-film fan-out to provide electrical connections, a screw- 
mountable copper heat-sink that supports the package and also serves 
as a ground terminal, and a new fiber ribbon connector that couples 
the light from the LEDs into an array of six fibers. 

The LEDs described in this paper were fabricated on 625-um centers 
with 60-m-diameter emitting areas; however, arrays have also been 
made with 375-um centers. Each diode in an array produced ~3-mW 
total output power into air at 300-mA driving current. The variation 
in output from diode to diode in a given array is +0.25 dB. The light 
output is linear in the current range of ~25 mA to ~250 mA. A 
component spacing of 625 jm was chosen for the LEDs in the trans- 
mitter array, the photodetectors in the receiver array, and the fibers 
in the fiber cable. Having a standard spacing will decrease the design 
and production costs of such a package and simplify the optical 
connection between the package and the fiber cable. 

The optimum number of LEDs in an array will depend on particular 
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system requirements, fabrication yield of the array package, and the 
reliability of the LEDs that form the array. A design consisting of six 
LEDs per package was selected to prove the feasibility of packaging an 
array of LEDs, to explore its performance, and to determine some 
fabrication problems that may limit the density of devices in a package. 

To facilitate the description, the following discussion of the package 
design deals separately with the LED array, the DIP part, the optical 
coupler, and the fiber ribbon connector (see Fig. 1). 


ll. LED ARRAY 


Figure 2 shows an array of six GaAs LEDs of the Burrus type 
fabricated using a batch process technique. A plasma-deposited silicon 
nitride film, 750 A thick, is employed as the Zn-diffusion mask in 





Fig. 2—LED array. 
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forming the planar p—n junctions. The planar-diffused junction ap- 
proach is ideal for lateral current confinement in the light-emitting 
area and thus minimizes the cross modulation between adjacent LEDs. 

Burrus LEDs were selected because they were available in the 
laboratory, but arrays with other types of LEDs or lasers may be 
implemented.*” However, the chemically etched “well” in the Burrus 
LEDs may facilitate the alignment of the fibers provided the emitting 
areas of the LEDs are centered within the “wells.” 

The LEDs have a 40-MHz bandwidth (3-dB point), and the inter- 
modulation between the light intensity of two fibers placed in front of 
two adjacent LEDs is lower than —50 dB. 


lil, DUAL-IN-LINE PACKAGE (DIP) COMPONENTS 


The LED array shown in Fig. 3 was mounted using standard Ic 
technology. The stud of silver supporting the LED array is indium 
soldered to a copper plate that serves as the heat sink and as the 
electrical connection for all the anodes of the LEDs. Gold-plated silver 





Fig. 3—LED array after being mounted on the copper plate. 
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tabs connect the cathode of every LED to a thin-film fan-out mounted 
on a ceramic board with copper leads. There is space in the package 
for several Ics that can serve as drive circuits for the LEDs. 


lV. OPTICAL COUPLER 


The optical coupler, which consists of six fiber stubs, serves the 
following two functions: first, it provides a protective housing for the 
LEDs and any associated circuitry, and second, it provides a means of 
light guidance from the LEDs to the fiber ribbon connector. 

Several optical couplers have been implemented to improve the 
coupling efficiency between LEDs and fibers. Selfoc lenses,’ glass 
spheres,’ and tapers are all possible solutions. For an initial trial, short 
pieces (5-mm long) of graded-index fiber, similar to those in the fiber 
cable, were used because they were readily available. The character- 
istics of the fiber are: 0.22 N.A., 55-u4m core diameter, and 110-yum 
overall fiber diameter. | 

Before inserting the LED array into the mold for the casting opera- 
tion, the short pieces of fiber were mounted on the LEDs according to 
the following steps. The LED array is placed on a holder, and a current 
of 50 mA is applied to each LED. A 2-meter fiber with flat ends couples 
the light from one of the LEDs in the array to a power meter. The 
position of the fiber in front of the LED is adjusted until a maximum 
reading of the power meter is obtained. Then the fiber end is glued to 
the LED using a photopolymer (Norland Optical Adhesive No. 61). The 
fiber is subsequently fractured with a diamond scriber to yield a 5-mm 
length. This process is repeated for each LED in the array. 


V. FIBER RIBBON CONNECTOR 

A new demountable connector was designed to join a fiber ribbon to 
the package. The connector utilizes a metal plate with eight holes. 
Two large holes, 1.574 mm in diameter, serve as the guides for two 
stainless steel pins which align the two parts of the connector; six small 
holes, 114 um in diameter, align the fibers on 625-ym centers. The 
position accuracy of the fiber must be better than +2.5 um to achieve 
an insertion loss objective for the optical connection of less than 
0.5 dB.° 

The first prototypes of the connector were made of stainless steel by 
machining the plates in pairs. The insertion loss measured on connec- 
tions made of matched plates was 0.4 dB. However, insertion losses as 
high as 5 dB were measured on connections with plates of different 
pairs. 


VI. CASTING 


Figure 4 shows the mold used to cast the package of the LED array. 
The mold is made of brass and the movable parts are made of stainless 
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steel. The metal plate with two stainless steel pins and six small holes 
aligns the fibers 625 wm apart. The post inside the mold serves to 
position a nut before casting. The two side plugs cover the leads of the 
array during casting. 

The LED array with the six short pieces of glass fiber attached is 
placed into the mold, such that the six fibers enter into the six 
alignment holes of the stainless steel plate. Finally, the mold is covered 
and the casting material (epoxy, Bacon compound 84 plus an activator 
BA-63) is poured into the mold at room temperature. 

After a curing time of 24 hours, the LED array package is removed 
from the mold. The metal plate with two stainless steel pins and the 





Fig. 4—Casting mold to encapsulate the LED array. 
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Fig. 5—LED array package and cable termination. 


nut, inside the mold, becomes part of the package. The fibers sticking 
out of the connector surface are shortened using a diamond scriber. 
The two stainless steel pins are momentarily removed, and the con- 
nector surface is lapped and polished. 

The cable termination of the ribbon connector was cast in a way 
similar to the LED array. The six fibers of the cable were threaded into 
the holes of a metal plate, and the connector was cast using the same 
mold as in the case of the LED package. Figure 5 shows the finished 
LED array package and the cable termination. 


Vil. CONCLUSIONS 


A package with an array of six LEDs has been described. The design 
represents a first effort at reducing the cost and the size of electro- 
optical interfaces for applications in low-bit-rate transmission systems 
and computer data links. A package containing six GaAs LEDs, electri- 
cal connections, and a new fiber ribbon connector was fabricated, 
mounted, and encapsulated using presently available technology. 

The LEDs in the array, with 80-mA dc applied, couple an average 
optical power of 5 nW into a graded-index fiber with 0.22 N.A. and 55- 
um core diameter. Cross-modulation between adjacent LED signals is 
smaller than —50 dB. Fiber ribbon connectors fabricated with matching 
plates which align the fibers with the light output from the LEDs 
showed insertion losses as low as 0.4 dB. 
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DPCM with Forced Updating and Partial 
Correction of Transmission Errors 


By R. STEELE, D. J. GOODMAN, and C. A. MC@ONEGAL 
(Manuscript received September 28, 1978) 


A speech signal is dpcm (differential pulse code modulation) en- 
coded at the sampling rate f, and also pcm encoded at a rate f,/W. 
Blocks of W dpcm words and one pcm word are transmitted. The 
receiver compares the decoded dpcm signal at the end of every block 
with the decoded pcm sample. If the difference is above a threshold, 
it is assumed that one error exists within the block, and a search ts 
made for the erroneous dpcm code word. Correction is accomplished 
by inverting the bits in this code word until the difference is below 
the threshold. Whether or not the error ts corrected, the dpcm signal 
at the end of the block is forced to the value of the pcm sample, 
thereby preventing error propagation outside the block. The system 
improves segmental s/n ratio by 7 dB for W = 64 and bit error rates 
between 0.1 and 0.5 percent. Larger improvements are available with 
smaller block sizes. In the absence of transmission errors, there is no 
perceptible distortion due to the correction system. 


I. INTRODUCTION 


We describe a new method of protecting dpcm (differential pulse 
code modulation) speech signals against the effects of transmission 
errors. The dpcm bit stream is divided into blocks, and one pcm sample 
is transmitted with each block. At the receiver, the appropriate sample 
of the integrated dpcm signal is compared with the pcm sample. A 
disparity between the two samples is evidence of a transmission error 
within the block. 

When an error is thus detected, the dpcm integrator is reset to the 
value of the pcm sample and an algorithm is invoked to locate the 
error within the block. When the algorithm is successful, the transmis- 
sion error is completely corrected. Even when the algorithm is unsuc- 
cessful, the resetting of the integrator at the receiver prevents the 
error from propagating outside the block in which it occurs. 
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This approach to error protection is different in spirit from conven- 
tional channel coding aimed at protecting a digital information stream 
regardless of its nature. Our method is directly keyed to the dpcm 
character of the message. Because it introduces its own redundancy to 
a dpcm signal, it is more powerful than the ppc (difference detection 
and correction) system described by the authors.'” ppc is implemented 
at the receiver only and infers transmission errors from anomalies 
within the integrated dpcm sample sequence. 

Other authors have reported on the periodic transmission of pcm 
code words in a dpcm picture coding system.® The pcm samples were 
used to update the receiver integrator and thereby curtail visible 
streaks caused by dpcm transmission errors. The use of pcm samples 
for error detection and correction is new to this paper. 


ll. SYSTEM DEFINITION 
2.1 The transmitter 


The transmitter (Fig. 1) sends one block of data every W sampling 
intervals. The data consist of the W code words of a conventional 
dpcm encoder plus one pcm code word formed by quantizing the input 
sample at the end of the block. In our implementation, the pcm and 
dpcm samples are formed by the same quantizer. As a consequence, 
for intermediate and high level inputs, the pcm signal is unable to code 
the entire range of signal amplitudes. Therefore, at the receiver, the 
error control mechanism is disabled when the pcm quantizer is over- 
loaded so that errors affecting high amplitude speech samples go 
uncorrected and are allowed to propagate beyond the blocks in which 
they occur. We accept this penalty in order to derive the convenience 
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Fig. 1—Transmitter. The dpcm and pcm bit streams are multiplexed to form the 
transmitted sequence. The symbol rate is f,(W + 1)/W Hz. 
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Fig. 2—Receiver. If, at the end of the block, the pcm sample and the dpcm sample 
differ by more than A, the correction logic is introduced. The switch in the integrator is 
in position 1 for W samples and goes to position 2 at the end of each block provided x 
is not an extreme value. 


of a system with one quantizer, because high amplitude samples occur 
with relatively low probability in speech, and because we anticipate 
that errors in the idle channel and low amplitude regions of the signal 
are the most damaging ones subjectively. 


2.2 The receiver 


During reception of the nth block of data (Fig. 2), the integrator 
generates W dpcm output words ynw+,;, from the received dpcm 
sequence {g;,} according to: 


Vnwsj = AYnw+y—1 a InW+js J = 1, 2; ee W. (1) 


These W samples are stored for possible revision by the error correc- 
tion algorithm. At the end of the block, ¥n+1,w is reset to the pcm 
sample X(n+iw if X(n+1)w 1S not the most positive or the most negative 
code word. When X(n+1)w is at an extreme value, ¥(n+1)w remains set at 
the value computed from (1). 


2.2.1 Error detection 


Provided X(n+1)w does not indicate quantizer overload, it is compared 
with yin+1)w derived from (1). If these two samples differ by more than 
A, the quantizer step size, an error in the nth block is inferred and a 
search for the error is initiated. Otherwise the samples 


VnW+) ’ YnaWw+2 gh fy Mn+1)w (2) 
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are sent to the system low-pass filter, V(in+i1)w 1S reset to X(n+1yw, and 
block n + 1 is processed. 


2.2.2 Error correction 


When | X(n41).w — Yint+1)w| > A, the system scans the samples in its 
buffer and finds the largest sample-to-sample difference in the block. 
That is, 1t computes 


5; = | Yaw — Ynwej-1|, J =1, 2. WS Sig Ww 


and determines the maximum 4,. If this maximum occurs at the rth 
position in the block, the corrector modifies q, by successively inverting 
bits in the code word L,. With each bit inversion, g-, a version of g,, is 
formed and (1) is iterated to produce a new trial value of ¥(n41)w, which 
we call ¥(n+1)w. Thus we have 


YnWer = AYnW+r-1 + gr 
and 
Yawsj = AVaw+j-1 + Qnwej,  J=Brti,r+2,---, W. 
Now, if ¥(n+1)w satisfies the test, 
| Vinsyw — Xinsyw| SA, 
the modified sequence 


Ynw+l, 7-8 ey YnW+r—l1y YnW+rs oe 8S ag Y(n+1)W 


becomes the system output. Otherwise, a new value of gq, is obtained 
by inverting another bit in L,. 

If no single-bit inversion in L, succeeds in bringing ¥(n+1) w sufficiently 
close to X(n+1)w, the correction attempt ceases and the samples in (2) 
are sent to the output filter. 

Clearly, this correction scheme is effective only when there is one 
bit in error in a block of W samples and this single error leads to a 
large sample-to-sample difference. However, even when there is more 
than one error in the block, the updating of the integrator signal 
prevents long-term propagation of error effects. 

Errors in pcm samples induce distortions that would not be present 
in ordinary dpcm. Our performance evaluations indicate that the 
effects of these distortions are substantially smaller than the benefits 
of error correction and integrator updating. 


il. EVALUATION 


The technique was evaluated by means of computer simulation of a 
7-bit, 8-kHz single-integration dpcm system with prediction coefficient 
0.9. To obtain the objective measurements displayed in Figs. 3 to 5, we 
repeatedly played a single sentence through the system: “I have two 
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SEGMENTAL s/n RATIO IN DECIBELS 





0 
—20 —10 0 10 20 
RELATIVE INPUT LEVEL IN DECIBELS 


Fig. 3—Performance as a function of input level. (a) No transmission errors. Segmen- 
tal s/n ratio reflects only quantizing noise. The curves with and without error protection 
virtually coincide. (b) Transmission error rate is 0.0014 and the system is unprotected. 
(c) Error rate is 0.0014 and protection block size is 64. 


daughters, Lorna and Susan,” spoken by a male. By listening to many 
samples of processed speech from a variety of talkers, we confirmed 
that the error protection indicated in these graphs is subjectively 
meaningful and not peculiar to a single utterance. 

For each system configuration, we measured segmental signal-to- 
noise ratio,* defined as the decibel average of the signal-to-noise ratios 
in 214 speech segments, each of duration 16 ms. These are the segments 
(out of 224 in the 3.5-second utterance) in which the rms signal level 
exceeds —60 dB relative to the peak signal. Segmental s/n ratio is 
considered a better indicator of speech quality than ordinary s/n ratio. 
In a study of adaptive dpcm, there was a correlation of 0.93 between 
segmental s/n ratio and subjective ratings of speech quality.” The 
comparable correlation with ordinary s/n ratio was only 0.69. 

Variables in our experiment were input signal level, block size W, 
and transmission error rate. The simulated channel introduces random 
errors to the serial bit stream consisting of W 7-bit dpcm code words 
and one 7-bit pcm code word per block. 

Figure 3 shows segmental s/n ratio as a function of input level for a 
block size W = 64 and for unprotected dpcm. The top curve is for zero 
error rate, in which case s/n ratio is the same (to within 0.3 dB) with 
and without the error protection mechanism. Curve b shows the effect 
on dpcm of a channel with error rate 0.0014. In the granular noise 
region (signal level below the peak of curve a), these errors cause a 
degradation of about 13 dB in s/n ratio. Over most of this region, the 
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Fig. 4—The effect of block size on performance. The relative input level is 0 dB and 
the error rate is 0.0014. Over most of the range, s/n ratio decreases by about 2 dB per 
octave increase in block size. 


protection system redeems about 9 dB of this loss. For reasons dis- 
cussed in Section 2.1, the effectiveness of the error protection dimin- 
ishes as signal level increases. 

Figure 4 pertains to the 0-dB relative input level (see Fig. 3) and 
0.0014 error rate. It shows the effect on segmental s/n ratio of varying 
the block size, W, over the range 8 to 128 samples. A small block size 
offers more protection but exacts a greater penalty in transmitted bit 
rate than a large block size. With W = 8 there is almost one extra bit 
per code word and an improvement of 11.5 dB in s/n ratio. The 
improvement decreases by about 2 dB per octave change in W over 
the range we investigated. 

The effectiveness of the error protection as a function of channel 
quality is shown in Fig. 5 for the input level 0 dB and W = 64. Our 
error protection mechanism results in an s/n ratio improvement of 
about 6 to 8 dB at error rates between 0.001 and 0.01. For comparison, 
we also show the performance of a dpcm system protected by the DDC 
(difference detection and correction) scheme.'” DDc requires no mod- 
ification of the dpcm transmitter and no additional transmitted bits. 
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Fig. 5—Performance as a function of channel error rate, 0 dB relative input level. 
Here, the block protection scheme is compared with ordinary dpcm and with dpcm 
augmented at the receiver by ppc, a difference detection and correction system. 
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TIME 


Fig. 6—Oscillograms of a segment of the test utterance. (a) Original speech, 0 dB 
relative input level. (b) dpcm output with 0.0042 error per bit. (c) After block protection 
with W = 64. (d) After selective smoothing of (c) by means of DDc. 


DPCM WITH FORCED UPDATING 727 


Its performance, however, is about 2 dB poorer than that of the new 
scheme. 


IV. REPROCESSING THE PARTIALLY CORRECTED SPEECH 


The forced updating of the integrator signal itself causes sharp 
transients in the system output when a detected error cannot be 
corrected. Thus, the output signal often contains many spurious spikes. 
This phenomenon is illustrated by the speech waveforms in Fig. 6. A 
segment of the original speech signal is shown in Fig. 6a, and Fig. 6b 
shows the same segment corrupted by transmission errors (which 
occurred with probability 0.0042). Figure 6c shows the output of the 
protected dpcm system, which has suppressed most of the channel 
error noise but left a residual spike (click) in the signal. ppc® (as 
applied to pcm) is designed to smooth out such spikes and the effect 
of reprocessing the system output with DDC is seen in Fig. 6d. In 
general, appending DDc to the forced updating method is effective in 
suppressing residual impulse noice. 


V. CONCLUSIONS 


By periodically introducing pcm samples to a dpcm signal sequence, 
it is possible to reduce substantially the propagation of transmission 
errors in dpcm. Furthermore, at the cost of some delay, storage, and 
elementary signal processing in the receiver, Many errors can be 
completely corrected. Small block sizes (frequent pcm transmissions) 
are more effective but introduce larger transmission rate penalties 
than large block sizes. 

We have shown that this block protection scheme improves segmen- 
tal s/n ratio of speech signals, and our experience of listening to several 
speech samples processed this way confirms that subjective quality is 
correspondingly enhanced. The method is also very appropriate to 
dpcm transmission of video signals in which error propagation causes 
very objectionable streaks in pictures.° 
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Equivalent Circuits for the Analysis and 
Synthesis of Switched Capacitor Networks 


By K. R. LAKER 
(Manuscript received August 17, 1978) 


An extensive library of z-domain building block equivalent circuits 
is derived to facilitate the analysis and synthesis of switched capac- 
itor (sc) networks. These sc building blocks, typically comprised of a 
single capacitor and from one to four switches, serve as basic circuit 
elements for sc networks in much the same spirit that resistors and 
capacitors serve analog networks. This building block library facilt- 
tates the derivation of canonic z-domain equivalent circuits for 
complex sc networks and the application of well-established mathe- 
matical network analysis and synthesis tools. What has been sought 
are easily applied techniques for achieving the same insights for sc 
networks that we have long enjoyed with analog networks. 


1. INTRODUCTION 


Over the past several years, many researchers''” have searched for 
the means to realize monolithic analog recursive filters, particularly 
for voice frequency applications. Initial attempts to realize a monolithic 
filters technology led to the development of active-R or resistor-only 
active filters..®° By removing the large external capacitors (C ~ 
5000 pF), such filters are, in principle, highly suited to integration with 
standard bipolar processing. The frequency dependence for these 
filters is derived'” from the single-pole rolloff due to a compensation 
capacitor (C ~ 30 pF) to achieve a unity gain frequency of 1 MHz. 
This method of operation posed two significant barriers to the practical 
application of active-R filters; namely, large resistor ratios’ are re- 
quired to reach audio frequencies and the unity gain frequencies are 
not sufficiently stable'“* for precise filter realization. Although these 
barriers have to some degree been overcome,””® it has become clear 
that the future of integrated circuits is in MOS large-scale integration 
and very large-scale integration processing. LSI has substantially re- 
duced the cost of digital logic and memory, and VLsI will bring even 
further cost reductions. 
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With the overwhelming success of digital MOS LSI and the promises 
of VLSI as motivation, much productive effort’"'” has been spent on the 
development of a compatible Mos LSI sample data technology. This. 
work culminated in the development of compact operational ampli- 
fiers'’” with acceptable noise and power specifications, charge transfer 
device (cTD) transversal filters,'* '” and switched capacitor (Sc) recur- 
sive filters’'! which fully utilize the advantages provided by MOS LSI. 
The transfer function coefficients’ '' of an sc recursive filter are 
determined by a highly stable clock frequency and capacitor ratios 
which can be held to very tight tolerances (measured"’ errors of less 
than 0.2 percent have been achieved for binary valued capacitor 
ratios). Furthermore, MOS capacitors are nearly ideal, with very low 
dissipation factors and temperature coefficients” of less than 100 ppm/ 
°C. This process of inherent precision and quality is sufficient to meet 
many filter and system specifications. 

Considering the growing interest in MOS switched capacitor net- 
works, the need is obvious for analytical and computer-aided tools'”~' 
for the analysis and synthesis of sc networks. The pioneering work'” '’ 
of Kurth and Moschytz provided a rigorous, network-theoretic foun- 
dation to the characterization of sc networks. They considered the 
analysis of sc networks comprised of capacitors and periodic, bi-phased 
switches. These networks, which are sampled data in nature, were 
shown to be characterized by nodal charge difference equations with 
periodically time-varying coefficients. This system of equations can be 
transformed into the z-domain*™” *’ to obtain the frequency response of 
the sc network. To reduce the analytical complexity, a building block 
approach was introduced” with the six basic building blocks: (7) shunt 
capacitor, (zz) series capacitor, (zz) shunt capacitor in parallel with a 
switch, (iv) series capacitor in parallel with a switch, (v) shunt switch, 
and (vi) series switch. These building blocks were expressed as four- 
port equivalent circuits with each two-port pair accounting for each of 
two signal paths which result from the two switch phases. The two 
signal paths, denoted even and odd, were shown to be linked by a 
common link two-port (LTP)'’ network. Any sc network, comprised of 
bi-phase switches,.can be transformed into a z-domain equivalent 
circuit by interconnecting the appropriate combination of these build- 
ing-block equivalent circuits. This equivalent circuit then provides the 
network designer with a pictorial representation of the circuit from 
which transfer relations can be derived between any pair of node 
voltages and provides the instant insight to circuit innovations we 
have long enjoyed with analog, linear, time-invariant networks. 

The primary objective of this paper is to extend the Kurth- 
Moschytz” library of building blocks to include those higher-order sc 
elements which occur frequently in complex sc networks. These ele- 
ments are typically comprised of one capacitor and from one to four 
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switches. With this expanded hbrary of equivalent circuits, one can 
efficiently derive canonic 2-domain equivalent circuits for any SC 
network. It is shown that, by manipulating the equivalent circuit for a 
toggle-switched four-port element one can derive all the equivalent 
circuits in the library. Alternative interpretations of the Kurth- 
Moschytz LTp are also provided which facilitate the derivation of 
canonic equivalent circuits. Finally, several examples are given which 
demonstrate the ease in which equivalent circuits are constructed and 
the insight derived therefrom. 


Il. APPLICATION OF Z-TRANSFORM TECHNIQUES TO THE ANALYSIS OF 
SC NETWORKS WITH BI-PHASE SWITCHES 


This section briefly reviews the basic assumptions’ '’ regarding the 


sampled data nature of sc networks and the fundamentals germane to 
the derivations and procedures given in the succeeding sections. This 
review also provides an opportunity to define symbols and to acquaint 
the reader with the notation employed. 


2.1 Operation of ideal SC networks 


Consider now the operation of an ideal sc network, comprised of 
ideal capacitors, ideal switches, and ideal voltage-controlled voltage 
sources (1.e., ideal operational amplifiers) when excited by sampled 
data voltage inputs. Typically, the switches are controlled by a two- 
phase, nonoverlapping clock of frequency f/f. = 1/27, as shown in Fig. 
1. Note that $° is used to denote the even clock phase that instanta- 
neously closes the e-switch on the even 2k T times. Similarly, 6" denotes 
the odd clock phase that instantaneously closes the o-switch on the 
odd (2k + 1)T times. The switches are assumed to have a 50-percent 
duty cycle with equal (7-second) on- and-off time periods. It is further 
assumed that both the input and output of the sc network are sampled 
data signals which change in value only at switching instants kT. Thus, 
in their most general form, the voltage sources and internal circuit 
voltages are assumed to be sampled at times kT and held over a one- 
half clock period interval (7) as shown in Fig. 2a. With this assumption, 
we can apply’ z-transform techniques to the general analysis and 
synthesis of sc networks. The z-transform, z = e”, where s is the 
complex analog frequency variable and 7 is the clock period, then 
provides us with a convenient means for performing frequency domain 
analysis. Of course, the z-domain transfer functions obtained from this 
procedure relate the input and output samples of switched capacitor 
networks. ‘Thus, to obtain the response, the input must also be char- 
acterized in the z-domain. Furthermore, if the output is considered as 
a held’ (staircase) signal, this computed response must be modified”! 
by (sin x)/x. When continuous inputs are applied directly” ~' to sc 
networks, the analysis can become considerably more complex. 
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Fig. 1—(a) Simple switched capacitor network with (b) two-phase nonoverlapping 
clock. 
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Fig. 2—(a) Sampled data voltage waveform portioned into its (b) even and (c) odd 
components. 
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As pointed out in Ref. 16, the switching action described in Fig. 1 
provides a time-varying nature to the sc network. That is, as the 
switches open and close, the network graph changes, alternating be- 
tween two topologies. One topology corresponds to the even clock 
phase and a second topology to the odd clock phase. Thus, one can 
view the time-varying sc network, with bi-phase switches, as two 
interrelated time invariant networks.'°”! In lieu of this fundamental 
approach, it is mathematically convenient to partition'® the sampled 
data voltage waveform in Fig. 2a into its even and odd components as 
shown, respectively, in Figs. 2b and 2c. Comparing uv" and v® to the 
clock waveforms ¢° and ¢$°, we observe that v° is only nonzero when 
the e-switch is closed and v’ is only nonzero when the o-switch is 
closed. This fundamental observation’ '’ has opened the door to a 
rigorous understanding of switched capacitor networks and has re- 
sulted in several methods for their analysis. 

One way to interpret the relationship between the even and odd 
topologies is to consider them topologically decoupled, with the states 
of one determining the initial conditions for the other.'* '” This inter- 
pretation results in two distinct circuits coupled together via dependent 
sources which establish the initial conditions mentioned previously. 
This formulation has been found” '’ to be particularly convenient for 
computer-aided analysis. Another interpretation” '’ is to combine the 
even and odd networks topologically into a single z-domain equivalent 
circuit. In general, an n-port sc network’® '’ will require a 2n-port 
equivalent circuit, 1.e., 7-ports for the even clock phase and n-ports for 
the odd clock phase. It is this interpretation that provides the kinds of 
valuable insight that Laplace transform techniques have provided for 
analog linear-time invariant networks. 

Since sc networks can be most rigorously characterized'® '’ in terms 
of charge transfer operations, discrete time voltages u;(k7T') and dis- 
crete time charge variations Aq;(kT) are used as port variables. At the 
switching times kT, charges are instantaneously redistributed with the 
principle of charge conservation maintained at every node in the 
network. It is this principle that allows us to write nodal charge 
equations in the same spirit with which we apply Kirchoff’s current 
law to continuous networks. In general, due to the bi-phase switching 
operation, two distinct, but coupled, nodal charge equations are re- 
quired to characterize the charge conservation condition at a particular 
node for all time instants kT. Namely, one equation for the even 2kT 
times and a second equation for the odd (2k + 1)T times are required. 
These equations are written, for some node p, as 

My, 


My, 
Agi(kT) = ¥ ghil(kT) — Y. ghi((k — 1)T) 
=] i=} 


for k an even integer, (la) 
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and 


Muy 


M,, 
Ag(kT) = ¥ gii(RT) -— XY api(R -1)T) 
i=1 i=] 


for k an odd integer, (1b) 


or equivalently in the z-domain 
Mep Mep 
AQ; (Zz) = » Qpi(z) — 21" > Qpi(Z) (2a) 
and 
M.,» Muy, 
AQn(z) = 2 Qpilz) — eg Y Q(z), (2b) 


where qgpi, gpi and Qi, QM); denote, respectively, the instantaneous 
charges stored on the ith capacitor connected to node p for the even, 
odd kT time instants and their z-transforms. Also M.,, and M.,, denote 
respectively the total number of capacitors connected to node p during 
the even and odd clock phases. 

For single capacitor sc blocks, z-transformed nodal charge equa- 
tions'’’ lead directly to the desired equivalent circuits as described in 
Section III. To characterize a complex sc network one simply substi- 
tutes, one-for-one, the appropriate z-domain block equivalent circuit 
for each sc element in the network schematic. As demonstrated in 
Section IV, transformed nodal charge equations for each node in the 
network are then written by inspection from the equivalent circuit. 
The desired voltage transfer function(s) is then obtained by algebrai- 
cally manipulating these z-domain equations in the usual manner. 


2.2 Sample data waveforms 


It should be noted that there are several sample data waveforms 
that can be modeled as special cases of the waveform depicted in 
Fig. 2. These waveforms and their respective even and odd components 
are shown in Fig. 3. One can immediately invoke the z-transform to 
mathematically describe these waveforms. The return-to-zero wave- 
forms in Figs. 3a and 3b can be expressed mathematically as 


Vulz) = Valz) + Valz), (3a) 
where 
Vil(z) =0 (3b) 
and 
Vi(z) = Vilz) + Vi(z), (4a) 
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Fig. 3—Common special-case sampled data voltage waveforms and their respective 
even and odd components. 
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where 
V3(z) = 0. (4b) 


In a similar manner, we can characterize the full clock period sampled 
and held (S/H) waveforms in Figs. 3c and 3d as 


V.(z) = Ve(z) + Vel(2), (5a) 
where 
Vi(z) = 2" Ve(z) (5b) 
and 
Valz) = Valz) + Valz), (6a) 
where 
Vie(z) =z '?Va(z). (6b) 


If a capacitor of value C is placed across the terminals of the voltage 
source u,(t), we observe that the charge on the capacitor changes in 
value only once per clock cycle, i.e., at the even 2kT' time instants 
when u,(t) changes. At the odd (2k + 1)T time instants v.(t), the 
capacitor voltage is unchanged, thus, the charge remains constant. 
This phenomenon is described analytically, for the even and odd clock 
phases, in the following manner: 


AQ’(z) = CVE(z) — C2" Ve(z) = C(I — 27!) VE(z) (7a) 
and 
AQe(z) = CVe(z) - C27? Ve(z) = 0. (7b) 


It is noted that the condition AQ¢(z) = 0 can also be obtained by 
disconnecting V. from the capacitor with a switch which is open during 
the odd clock phase. Thus, in the sense that no charge is transferred, 
this open circuit condition also implies a full cycle sample and hold | 
operation. Corresponding relations can also be written for the full cycle - 
S/H waveform in Fig 3d. 

It is useful to note that the return-to-zero waveforms U,(t) and vs(t) 
can be obtained by processing u(t) in Fig. 2 with simple switch 
networks as shown in Fig. 4. When the switches in Fig. 4 are ideal, va 
and vu, are ideal, zero-impedance voltage sources with waveforms as 
depicted in Figs. 3a and 3b, respectively. 

The various sample data waveforms considered in this section can 
be generated externally (i.e., an independent voltage source) and at 
any internal node by an appropriate combination of switches and 
capacitors. It is often crucial, particularly at the network output where 
one may either resample or couple to another sc network, to identify 
the waveform type of internal node voltages. From the properties 


736 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1979 


described in this section, this identification process is usually straight- 
forward. 


2.3 SC network transfer function relations 


The voltage transfer function is well recognized as a convenient 
mathematical tool for the analysis and synthesis of continuous, linear, 
time-invariant networks. The value of the voltage transfer function is 
not expected to diminish with sc networks. At this point, it should be 
obvious that sc network transfer functions are most conveniently 
written in the z-domain. 

Let us, for simplicity, confine the discussion in this subsection to 
two-port Sc networks with one input and one output. As noted previ- 
ously, the two-port can be represented by an equivalent four-port, as 
shown in Fig. 5. In general, a 2 X 2 transfer matrix is required to fully 
characterize the input-output relations for this four-port network, 1.e., 


Vout(2Z) = H(z) A(z) Vin (z) (8) 
Vout(Z) A3(z) H4(z) in(Z) 


where, by superposition, 


Vin(Z) = Vin(z) + Vin(z) (9a) 
Vout (2) = Vout (Z) + Vout(Z). (9b) 
v(t) { | v(t) v(t) t | vp (t) 
(a) (b) 


Fig. 4—Switch networks for return to zero voltage generation. 
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Fig. 5—Four-port z-domain equivalent circuit. 
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In many cases, the signal conditioning performed at the input and 
output imposes constraints on the form of the transfer relations. For 
example, consider the application of the return-to-zero source in Fig. 
3a to the sc network in Fig. 5. Substituting Vi;,(z) = 0, obtained from 
eq. (3b), into eq. (8) yields the following transfer relations 


Vou(Z) = Ai(z) Vin (Zz) (10) 
and 
Vou (z) = Hs(z) Vin (Zz). (11) 


Thus, depending on whether v,,.(kT) is sampled at the even 2kT times 
or the odd (2k + 1) T times, the voltage transfer function is either H, (z) 
or H3(z), respectively. However, if Vou is sampled at all kT times, then 


Vou(Z) = (Ai(z) + Hs(z)) Vin(z). (12) 


In general, H\(z) * H3(z); however, they are obviously interrelated. 
Therefore, one is not able to independently synthesize H(z) and H3(z). 
In practice, by appropriately conditioning the input and output 
signals, one can realize an Sc network which is completely character- 
ized by a single transfer function. Equations (10) and (11) describe 
examples of this class of Sc network. One can in principle synthesize 
sc networks of this type directly in the z-domain using digital filter” *” 
synthesis techniques. Several examples of multi-transfer function and 
single transfer function sc networks are provided in Section IV. 


lil. EQUIVALENT CIRCUIT MODELS FOR SC BUILDING BLOCKS 


In this section, multi-port z-domain equivalent circuits will be de- 
rived for several sc building blocks. It has been shown" that any sc 
network can be constructed from the six blocks mentioned in Section 
I and voltage controlled voltage sources. The objective here is to 
facilitate the application of this approach by deriving a library of 
higher order building blocks which, when interconnected, lead to 
canonic z-domain equivalent circuits. sc elements comprised of one 
capacitor and from one to four switches are treated as basic circuit 
elements much like passive R’s, L’s, and C’s in analog circuits. As 
noted in the previous section, the z-domain transfer relations can be 
derived from the equivalent circuit using familiar network analysis 
techniques.” 

Figures 6 and 7 contain listings of the commonly occurring Sc 
elements and their respective z-domain equivalent circuits or building 
blocks. In addition to the Sc building blocks, z-domain models are also 
given for each of the sample data sources discussed in the previous 
section. This library is sufficiently general to accommodate all the 
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DISCRETE TIME CIRCUIT Z—DOMAIN EQUIVALENT CIRCUIT 
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Fig. 6—General library of 2n port z-domain equivalent circuits for switched capacitor 
building blocks (continued on pp 740-743). 
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DISCRETE TIME CIRCUIT Z—DOMAIN EQUIVALENT CIRCUIT 


(e) TOGGLE SWITCHED DIFFERENCER (TSD) 
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(f) SINGLE PHASE GROUNDED CAPACITOR 
(SPGC) 
4q, e,O 


(g) SINGLE PHASE FLOATING CAPACITOR (SPFC) 





Aq, 





4q, 





Fig. 6—(continued). 


published”! '”***" sc networks which use nonoverlapping bi-phase 


switches. The equivalent circuits in Fig. 6 are derived in their most 
general 2n-port form, assuming that all voltages update at one-half 
clock cycle intervals, as per u(t) in Fig. 2. The e, o notation refers to 
the switch phasings as noted in the previous section. Similarly, super- 
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DISCRETE TIME CIRCUIT Z—DOMAIN EQUIVALENT CIRCUIT 


(i) SHORT—-CIRCUIT GROUNDED RESISTOR (SGR) 
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(k) OPEN—CIRCUIT GROUNDED RESISTOR (OGR) 
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Fig. 6—( continued). 


scripts e, o and o, e are used to denote the even or odd port variable 
(V;, AQ;) components and the complement odd or even port-variable 
components, respectively. This e, o notation conveniently provides the 
connectivity information for interconnecting the building blocks. 
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DISCRETE TIME CIRCUIT Z—DOMAIN EQUIVALENT CIRCUIT 











(m) OPEN-CIRCUIT FLOATING RESISTOR 








(OFR) AQe° P Aaa%° 
Aq, e,0 C AG, + 
% Nw —_ 4 e,0 
e,0 
V 4 V5 
y 0,e = = 
1 ~ > — 0,e oe 
Vi V>' = 
oo +O——______—-® o—__—_—_—ot+ 
=— AQ4"= A0%S° =0 
(n) TOGGLE SWITCHED FLOATING aa‘ Aa%° 7 


FOUR-PORT (TSFFP) 


4q, eo 


Wee 
+ 


we 
4a, 


(0) FLOATING BI-VALUED RESISTOR 











(FBR) e,0 AQ®°? C, 4as° 
ass a 
Hi + 
0,e 
Cy vie - vse 
Aq, eo C Aq? = = 
how 1 = - 
+ aN. = a 
o0,e vr" y oe 
vy vo C, 
oe + 
<a omar 
= AQ?* aay? 
(p) GROUNDED BI-VALUED RESISTOR ————<$<— ee 
(GBR) 
Aq, : Aq AQse-o Aas° 
She “WA 





Fig. 6—(continued). 


In practice, there are many Sc networks in which the charges and 
voltages update, due to the internal switching action of the sc network, 
only on full clock cycle intervals. This behavior, which is readily 
identified on a block-by-block basis, results in 2n-port equivalent 
circuits with n open ports. Many of the sc blocks in Fig. 6 fall within 
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DISCRETE TIME CIRCUIT Z—-DOMAIN EQUIVALENT CIRCUIT 


(q) INDEPENDENT VOLTAGE SOURCE (IVS) 
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Fig. 6—( continued). 


this category. When properly interconnected,'’ these 2n-port equiva- 
lent circuits can be reduced to the n-port equivalent circuits in Fig. 7. 
3.1 2n-port SC building block equivalent circuits 


In this section, derivations are given for several of the sc equivalent 
circuits in Fig. 6. These derivations will be based on z-transformed 
nodal charge equations which can be derived by inspection from the 
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DISCRETE TIME CIRCUIT SIMPLIFIED 2~—DOMAIN EQUIVALENT CIRCUIT 


{a) SINGLE PHASE GROUNDED 
CAPACITOR (SPGC) 





(b) SINGLE PHASE FLOATING CAPACITOR 
(SPFC) 





(c) OGR WITH SERIES SWITCH (OGR/SW) 


Aq, eo Aq, Aae.° AQ? 
Nw aN 1 2 
+ + 
e,o 
Vv 
2 
Vv vse C V5° 
1 => 1 2 
C o0,e 
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Fig. 7—Simplified library of n port z-domain equivalent circuits (continued on pp. 745- 
746). 


Sc circuit. As noted in the previous section, one can write a distinct 
nodal charge equation for each switch phase. Therefore an n-port SC 
block is characterized by 2n nodal charge relations. The desired 2n- 
port z-domain equivalent circuit evolves directly from these relations. 
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DISCRETE TIME CIRCUIT SIMPLIFIED Z—DOMAIN EQUIVALENT CIRCUIT 
(f) TOGGLE SWITCHED INVERTER (TSI) 
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Fig. 7—( continued). 


The equivalent circuit for a complex sc network is derived by properly 
interconnecting the appropriate block equivalent circuits. To avoid 
boring the reader with excessive repetition, derivations will only be 
provided for blocks b through f and | of Fig. 6. Once these derivations 
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DISCRETE TIME CIRCUIT 
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Fig. 7—(continued). 


are understood, the validity of the remaining equivalent circuits can 
be established by inspection. The independent and dependent voltage 
source equivalent circuits are obtained directly from the relations in 


the previous section. 
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3.1.1 Floating capacitor equivalent circuit 


One can derive the desired equivalent circuit in a straightforward 
manner directly from the nodal charge equations. In these equations, 
the even and odd voltage components (V° and V”) serve as independ- 
ent variables and the even and odd charge variation components (AQ 
and AQ”) serve as the dependent variables. Since the floating capacitor 
block in Fig. 6b contains no switches, the z-transformed nodal charge 
equations, where V°, V’, V5 and V2 are independent voltage excita- 
tions, are instantly written as 


AQ{°?(z) = 

CV$!(z) — C2? V9" (z) — CVS°(z) + C27!” V3"%(z), (18a) 
AQs°(z) = | 

CV$°(z) — C27? V3" (z) — CViP(z) + C27” VP" (z), (13b) 
AQ? (z) = 

CV}"(z) — C2? VE! (z) — CVS" (z) + C27! V8"(z), (18c) 
and 
AQ?" (z) = 

CV3" (2) — C2“? V5"(z) — CVI (2) + C27? Vi" (z). (18d) 


There are perhaps several circuit interpretations for this set of 
equations. One convenient interpretation is the balanced lattice equiv- 
alent circuit shown in Fig. 6b. Another circuit interpretation”’ for these 
equations is a four-port network comprised of an unbalanced floating 
LTP coupled to the even and odd transmission paths via ideal trans- 
formers. By interpreting eqs. (13) as a balanced lattice, one can 
eliminate the transformers. This balanced lattice is referred to in this 
paper as a balanced floating LTP; in contrast, the Kurth-Moschytz 
circuit is referred to as an unbalanced floating LTP. Both circuits are 
equivalent and valid under all port termination conditions. 


3.1.2 Toggle switched capacitor’ (TSC) equivalent circuit 


Due to the switching action of the toggle switch, the capacitor C 
receives charge from v;, only on the even (odd) times and charge from 
UV, on the odd (even) times. When the switches are open, the corre- 
sponding ports are open and AQ = 0. These observations are consistent 
with the z-transformed nodal charge equations: 


AQi(z) = CV$%(z) — C2? VE"(z) (14a) 
AQ3°(z) = 0 (14b) 
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AQi"(z) = 0 (14c) 
AQ$*(z) = CV3"(z) — C27? V¥"(z). (14d) 


These equations lead directly to the four-port equivalent circuit in Fig. 
6c. As described in Ref. 17, an unbalanced LTP is seen to bridge the 1- 
e,o and 2-0,e ports. Note that ports l-o,e and 2-e,o are always open; 
therefore, no transmission occurs at these ports. This is a property 
common to all bi-phase toggle switched sc blocks (e.g., equivalent 
circuits 6d and 6e in Fig. 6). 


3.1.3 Toggle switched inverter" (TSI) equivalent circuit 


The operation of this circuit is similar to the Tsc element; with the 
exception that in the Ts1 the voltage is inverted as the charge on C is 
transferred from port 1 to port 2. This process is described by the 
following z-transformed nodal charge equations: 


AQS°(z) = CV9°(z) + C2 V3'"(z) (15a) 
AQ?*(z) = 0 (15b) 
AQ2°(z) =0 (15c) 
AQ3"(z) = CV3"(z) + C27? V9°(z). (15d) 


These equations are readily interpreted by the four-port equivalent 
circuit in Fig. 6d. Note for this block the l-e,o and 2-0,e ports are 
bridged by an unbalanced LTp-like network in which the storage 
elements (Cz~'’*) are all premultiplied by (—1). Since this network 
serves both as a link between even and odd transmission paths and as 
a signal inverter, it is referred to as an unbalanced inverting LTP. 


3.1.4 Toggle switched differencer" (TSD) equivalent circuit 


In this element, the charge on C is determined by the voltage 
difference v{°(kT) — vx°(kT) during the e,o switch phase. When the 
o,e switches close this voltage, difference appears directly across port 
3. This operation is described by the following z-transformed nodal 
charge equations: 


AQ$°(z) = CV9°(z) — CVS°(z) — C27” V3"(z) (16a) 
AQ?*(z) =0 (16b) 
AQS°(z) = CVS°(z) — CVT°(z) + C27” V3"(z) (16c) 
AQ3"(z) = 0 (16d) 
AQ3°(z) =0 (16e) 


and 
AQ3“(z) = CV3"(z) — C2? Vi°(z) + C2? VS"(2). (16f) 
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The six-port equivalent circuit representation for these equations is 
given in Fig. 6e. Note that three of the six ports are open. The Tsp 
element exhibits yet another form of LTP. In this element two e,o 
transmission paths are linked to a single o,e path through a differencing 
operation. It is perhaps appropriate to refer to this LTP as an unbal- 
anced differencing LTP. 


3.1.5 Single phase grounded capacitor (SPGC) 


This sc element occurs frequently in sc networks, particularly in 
low-pass Sc filters. In a sense, it serves as a companion element for the 
grounded capacitor in Fig. 6a. It is also a special case of the grounded 
capacitor. The nodal charge relations for this block are readily written 
as 


AQ{"(z) = C(1 — 27')V%"(z) (17a) 
AQS°(z) = C(1 — 2") V%"(z) (17b) 
AQ?*(z) =0 (17c) 
AQ>2"(z) = 0, (17d) 


where V°°(z) = V{¥°(z) = V3°(z). It is noted that one can derive eqs. 
(17) from the grounded capacitor equivalent circuit in Fig. 6a by setting 
AQ?* = AQ3* = 0, which implies V°*(z) = 27'”V%°(z). For the spcc 
block, V°*(z) represents the voltage stored and held on capacitor C 
and no longer refers to port voltages V1 and V2”. As noted in Section 
II, this condition is equivalent to a full clock period S/H. 

Equations (17a) through (17d) lead directly to the four port equiv- 
alent circuit in Fig. 6f. Due to the switches, two of the ports are open 
as described by eqs. (17c) and (17d). This network is equivalent to the 
open circuit LTP described in Ref. 17. The equivalent circuit for the 
floating capacitor is seen to similarly reduce to that in Fig. 6g when a 
series switch is added. Since these blocks occur frequently in complex 
sc networks, their recognition results in substantially simplified equiv- 
alent circuits. 


3.1.6 Open circuit grounded resistor with series switch (OGR/SW) 
— equivalent circuit 


This block performs a function similar to the sGR in Fig. 61, except 
that capacitor C is discharged while it is totally disconnected from the 
circuit. Therefore the shorted capacitor does not load the circuit during 
the discharging switch phase. The equivalent circuit, in Fig. 61, for this 
block is obtained from the following z-transformed nodal charge equa- 
tions 


AQY*(z) = CV"(z) (18a) 
AQi(z) = 0 (18b) 
AQS°(z) = CV*"(z) (18c) 
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and 
AQ3?*(z) = 0, (18d) 
where 
Vo°(z) = Vie(z) = Vo2"(2). 


This block is memoryless, and it is an open circuit during the o,e clock 
phase when capacitor C discharges. Thus during the e,o clock phase, 
the block serves as a resistor and during the o,e clock phase it does 
nothing. In many sc network arrangements, where it is only necessary 
to transmit during one clock phase, this block serves as an excellent 
resistor equivalent. 

Equivalent circuits 60 and 6p represent straightforward, nevertheless 
useful, generalizations of circuits 61 and 6m. These sc networks are 
seen to provide, for the even and odd clock phases, different value 
resistor-like components. This type of component suggests the possi- 
bility of time-sharing capacitors and operational amplifiers to achieve 
different even and odd circuit behaviors. 

This concludes the derivations for: equivalent circuits in Fig. 6. At 
this point, the interested reader should be able to derive the remaining 
Sc equivalent circuits easily. 


3.2 Simplified SC building block equivalent circuits 


In the previous section, it was observed that many of the four-port 
equivalent circuits result in n (of 27) open circuit ports. Obviously, 
any signals applied to one or more of these open ports will neither be 
processed nor transmitted. Therefore, sc networks comprised of these 
blocks will only provide transmission and filtering, when the switches 
are phased such that the blocks interconnect to provide one nonopen 
signal path’ from input to output. Assuming this connection rule, the 
open ports are nonfunctional and can be removed from the equivalent 
circuits. The immediate identification of these blocks in a complex sc 
network results in much labor-saving equivalent circuit simplification. 
More specifically, 27-port equivalent circuits reduce directly to n-port 
equivalent circuits. To emphasize this point, the appropriate four-port 
equivalent circuits in Fig. 6 have been reconfigured as two-port equiv- 
alent circuits in Fig. 7. Many complex sc networks”'' can be modeled 
exclusively with these simplified equivalent circuits. For this class of 
sc networks, circuit analysis is no more complex than that for contin- 
uous (linear) time-invariant networks. Since the blocks listed in Fig. 7 
perform all the necessary network functions, it is expected that one 
can synthesize general z-domain transfer functions using only these 
blocks. This restriction, with little sacrifice in generality, should lead 
to efficient z-domain synthesis procedures for Sc networks. 

In addition to the reconfigured equivalent circuits from Fig. 6, Fig. 
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7 contains two additional building blocks. These blocks are shown in 
Fig. 7h and 7i. Let us briefly discuss each of these blocks on an 
individual basis. | 


3.2.1 Toggle switched floating four-port (TSFFP) equivalent circuit 


This element is the most general of the toggle switched (single) 
capacitor elements. Thus the equivalent circuits for the TSC, TSI, and 
the Tsp can be derived directly from the equivalent circuit in Figure 
7h, by simply shorting to ground the appropriate port or ports. The z- 
transformed nodal charge equations for the block are expressed as 


AQS°(z) = CV¥"(z) — CV5°(z) — C27”? V3"(z) 


+ C27'°V3"(z) (19a) 
AQ ?"(z) = 0 (19b) 
AQS°(z) = CVS°(z) — CVS°?(z) — C27? V3"(z) 

+ C27'/°V3"(z) (19c) 
AQ2"*(z) =0 (19d) 
AQ3°(z) = 0 (19e) 
AQS*(z) = CV3*(z) — CV9*(z) — C27 V5"(z) 

+ C27'"°V$"(z) (19f) 
AQ7°(z) = 0 (19g) 


and 


AQi(z) = CVi*(2) — CV3"(2) — C2? V8"(z) 


+ C2? Vi"(2), oem 


To be completely general, eqs. (19) describe an eight-port equivalent 
circuit with four open ports. Such an eight-port description is shown 
in Fig. 6n. The more useful four-port equivalent circuit in Fig. 7h is 
obtained by deleting the open ports. 

Comparing Figs. 6b and 7h, one observes that the four non-open 
ports of the TSFFP are coupled together via a 90-degree rotated, 
balanced floating LTP. In fact, if the TSFFP is rotated 90 degrees with 
ports 1 and 3 serving as the incoming ports and ports 2 and 4 as the 
outgoing ports, we indeed have the equivalent circuit for the floating 
capacitor in Fig. 6b. If ports 2 and 4 are then shorted to ground, one 
can then easily derive the equivalent circuit for the grounded capacitor 
in Fig. 6a. All the toggle-switched (the Tsc, TSI, and TSD) elements can 
be readily derived from the TsFFP. For example, if ports 2 and 4 are 
shorted to ground, the TSFFP equivalent circuit reduces to that of the 
TSc in Fig. 7e. Also, the TsiI in Fig. 7f is obtained when ports 1 and 4 
are shorted to ground. In summary, by providing the proper termina- 


EQUIVALENT CIRCUITS FOR SC NETWORKS 751 


tion conditions, one can derive the equivalent circuits for any of the 
single capacitor sc elements in Figs. 6 and 7 with the TSFFP. 


3.2.2 Toggle switched blocks driven by full clock cycle S/H voltage 
sources 


Equivalent full-cycle time delays can be experienced when toggle- 
switched sc blocks are driven with full cycle S/H voltage sources. A 
situation of this type is illustrated in Fig. 71. The behavior of this 
circuit can be described in the following manner. When source U.,a(f) 
changes to value veg, the o,e switch is open; thus, the charge on 
capacitor C remains unchanged. One-half clock period later when 
switch o,e closes, capacitor C acquires the charge Cvég. Another one- 
half clock period later, the 0,e switch opens, the e,o switch closes, and 
v-'a appears at the output with a net time delay of one full clock period. 
Obviously, when the source changes value in synchronism with the 
initial o,e switch, the net time delay is one-half of a clock period. The 
SC circuit in Fig. 71 can be modeled according to the equivalent circuit 
in Fig. 8. Writing a nodal charge equation at node 2 yields 


CV59(z) = C21 (21 Vea(z)) = C27! Viealz). (20) 


The equivalent circuit in Figure 71 conveniently characterizes this 
relationship. Similar equivalent circuits can be derived for the TSI and 
TSD blocks as shown in Figs. 9a and 9b respectively. Full cycle time 
delays can readily'' occur when appropriately phased toggle switched 
blocks are driven by operational amplifier integrator circuits. 


IV. APPLICATIONS TO THE ANALYSIS AND SYNTHESIS OF SC NET- 
WORKS : 


In this section, the concepts developed in the previous sections are 
applied to the analysis of several passive and active sc networks. Many 
examples are simple, to emphasize the insight provided by the equiv- 
alent circuits. 


4.1 Passive SC networks 


In this section we examine two single pole passive sc networks. The 
equivalent circuits in Figs. 6 and 7 allow one to examine a given SC 
network under an assortment of input-output conditions, as in Fig. 3. 
As we will see, such an examination can reveal some rather interesting 
circuit behavior that is not immediately obvious. 


4.1.1 First-order, low-pass SC networks 


As the initial example, consider the simple first-order, low-pass 
network depicted in Fig. 10a. An equivalent circuit for this network 
can be obtained by simply cascading blocks 6c and 6f, as shown in Fig. 
10b. This circuit can obviously be reduced to that in Fig. 10c. One 
could have immediately written the equivalent circuit in Fig. 10c by 
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(b) 


Fig. 9—(a) Toggle switched inverter and (b) toggle switched differencer, driven by 
full clock period S/H voltage sources. 


cascading the simplified block equivalent circuits 7a and 7e. Writing a 
single nodal charge equation at node 2, namely, 


(Cy — Caz? + Cyz'? + Co — Caz") Viu(z) = Cz"? Vin(z) (21) 
yields the familiar low-pass z-domain transfer function 


_ Vout (Zz) _ Cz 
Vin (2) Ci + Co — Coz! 


Note that V%, and Vé.. are removed by sampling operations at the 
input and output respectively. Thus, the transmission through this 
network is completely described by a single transfer function, namely 
Hs. 


Ai3(z) (22) 


4.1.2 First-order, high-pass SC network 


The simple first-order, high-pass circuit shown in Fig. 11 is a rather 
interesting circuit, as we shall soon see. Its interesting behavior stems 
from the input-to-output switch free path which permits both Vj, and 
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v. (kT) C Vous (KT) 


in 1 Cy out 





Fig. 10—Single pole “‘passive” sc low-pass network. 


Vi, to determine the e and o components of V,,,,. To study this circuit, 
we write the equivalent circuit in Fig. 11b by cascading blocks 6b and 
6k. 


Analysis of the equivalent circuit yields the following relations: 


C\(1 - 2’) 
ou = ———__— Vin in 23 
Vout (2) Ga Get (z) + OVin(z) (23a) 
= A, (z) Vin(z) (23b) 
and 
oO —C,27 e oO 
Vou(z) = G+0-62" Vin(z) + Vin(z) (24a) 
= H3(z)Vin(z) + Ha(z) Vin (2). (24b) 
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(b) 


Fig. 11—Single pole “passive” sc high-pass network. 








Note: 
Vout (Z) C\(1 — 27’) 
MQ) ET) |va=0°G+G-Get 
is a first-order, high-pass function, while 
Vout (2) —C227'? 
H. A = —— 25b 
at) 7 in(2Z) in = 0 Ch — Cr — Cz’ 





is a first-order, low-pass function. This is a most interesting result, 
indeed. From eqs. (24), we observe that by forcing V?,, = 0, as in Fig. 
12a, this circuit behaves like a first-order, high-pass filter when the 
output is sampled on the even times and behaves like a first-order, 
low-pass filter when the output is sampled on the odd times. A circuit 
that achieves this bifunctional characteristic is shown in Fig. 12b. To 
achieve this behavior, a simple return-to-zero source of the form shown 
previously in Fig. 3a, or equivalently in Fig. 4a, is used to drive the 
high-pass circuit depicted in Fig. 11. 


4.2 Active SC networks 


Due to obvious reasons, much of our interest is in active SC 
: 8-11,26,27 ‘ : 
networks. These networks are typically comprised of capacitors, 


8-11,26,27 
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(b) 


Fig. 12—Single pole “passive” sc high-pass/low-pass network. 


switches and operational amplifiers. Many of the active sc networks 
appearing” '' in the literature are comprised of simple sc building 
blocks, of the form listed in Fig. 7, buffered by operational amplifiers. 
When these operational amplifiers can be assumed to be ideal, the 
virtual grounds result in further simplifications in the equivalent 
circuits. In the voltage-charge domain, a “virtual” ground at the input 
of an ideal operational amplifier shall be defined by the condition AQ 
= 0, V = 0. Let us now consider the equivalent circuit representations 
for the following selection of first-order active sc networks. 


4.2.1 Lossless integrator 


The equivalent circuit for the lossless integrator in Fig. 13a is 
derived, in full generality, using blocks 6b, 6c, and 6u as shown in Fig. 
13b. Of course, one may accommodate the finite gain of an actual 
operational amplifier using the voltage-controlled voltage source, cited 
in Fig. 6 as block 6t. The rather unwieldy circuit depicted in Fig. 13b 
can be immediately simplified by removing all elements shunting 
virtual ground points and voltage sources. This network is then re- 
drawn in the form shown in Fig. 13c, which can be again reconfigured 
to yield the circuit in Fig. 13d. Finally, the second stage of Fig. 13d is 
noted to be a voltage-controlled voltage source with B = z7'” as in 
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Fig. 13—Active-sc integrator. 


Fig. 8. The final equivalent circuit in Fig. 13e implies that the lossless 
integrator could have been derived directly from the simplified equiv- 
alent circuits in Fig. 7. 

The transfer functions for the lossless integrator are then readily 
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Fig. 14—Active-sc lossy integrator with Tsc damping. 


determined to be 


Veulz) _ — (Ci/C2)2-¥ 


A3(z) = Vez) rs i-2= (26a) 


and 


Voulz) = — (C\/C2)27’ 
A(z) = Ve. eee (26b) 
Note that when the output of the lossless integrator is sampled at the 
odd times, the transfer function is H; and when sampled on the even 


times the transfer function is Hj. 
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4.2.2 Lossy integrator with TSC 


As should be expected, the equivalent circuit for the lossy integrator, 
shown in Fig. 14a, is similar to that derived for the lossless integrator. 
To shift the pole to the left of z = 1 + j0, a toggle-switched capacitor 
(Tsc) has been placed across the feedback capacitor. Obviously, the 
intent is for the Tsc to play a role comparable to a resistor in active-RC 
lossy integrators. To analyze this network, let us first derive the 
equivalent circuit. This equivalent circuit can be derived step by step, 
as was done for the lossless integrator and shown successively in Figs. 
14b and 14c, respectively. The final equivalent circuit in Fig. 14c, like 
that in Fig. 13e, can be readily derived from the simplified equivalent 
circuits in Fig. 7 by direct substitution. This result simplifies tremen- 
dously the equivalent circuits for complex, high-order active sc net- 
works. 

The transfer functions for the lossy integrator are then readily 
obtained from the circuit in Fig. 14c, namely, 





— Veu(2z) (C1 /C2) 27" 
N= yey To = (C/Cne te) 
and 
e _ = | ; 
H(z) a Vout(Z) _ (C\/C.)z (27b) 





Viz) 1—[1—(C3/G)Jz" 


It is interesting to examine H(z) for different values of C3. Consider 
the following three conditions C; = C2, C3 = 2C2, and C3 > 2C2. For C3 
= C2: 


C 
H(z) = — G2” (28) 
an ideal half-delay element. For C; = 2C,, 
—(C,/C2)27'” 
H(z) oe, (29) 


— and the circuit is no longer stable. Finally, when C; > 2C, the pole of 
H3(z) lies outside the unit circle and the circuit is clearly unstable. 
Obviously, the Tsc is much more than a resistor; a point which has 
been illustrated’ in other ways. 

Let us look briefly at the effect of alternating the phases of the 
switches which make up the feedback Tsc. This sc network along with 
its equivalent circuits are shown in Fig. 15. The transfer functions are 
then readily written 

Virut(Z) —(Ci/Cy)27'” 


Yay T= [1 (G/GaE Ne 
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and 


Vou(Z) _ —(C\/C2)[ (C2 — C3)/Co]27! 
Vin(z) 1 — [1 — (C3/C2)]27' 


Comparing the pole location for H(z) in eqs. (27) and (380), one 
observes that the TSc switch phasing has no effect on this parameter. 
However, the de gain for the even component V,,. is altered by the 
factor (C2 — C3)/Co. It should be noted that these observations were 
not totally expected. 





A(z) = (30b) 





(c) 


Fig. 15—Active-sc lossy integrator with Tsc damping. The switches of the feedback 
TSC are phased opposite to that shown in Fig. 14. 
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4.2.3 Lossy integrator with OFR 


Another lossy integrator realization is shown in Fig. 16a. Of perhaps 
only theoretical interest is the comparison of the behavior of this 
circult with its counterpart in Fig. 14a. The equivalent circuit, shown 
in successive stages of simplification in Figs. 14b and 14c respectively, 
can be derived directly from the simplified block equivalent circuits in 
Fig. 7. The transfer functions for this circuit are 


— Voulz) — [-Ci/(C2 + C3)J20'” 


a) Vi(z)  1—[Co/(Co + C3)]z7 


(31a) 


and 


Vout (2) = [—C,/(C2 + C3)]2! 
Vin(z) 1—[C2/(C, + Cs)J27 
The transfer functions expressed in eqs. (31) are seen to be truly 


representative of lossy integrators. H3(z) and Hi(z) are absolutely 
stable for all finite values of Ci, Co and C3. 


M(z) = (31b) 





4.2.4 Bilinear lossless integrators*® 7’ 


Previously, in Section 4.1.1, an integrator was analyzed which 
achieved integration in the sample-data sense according to the trans- 
formations” 


1 Ze 

- toe one 
or 

1 ae 


It is well known” ** that eq. (32b) only adequately approximates the 
function of an analog integrator for frequencies satisfying wr < 1. 
Although eq. (32a) overcomes” this difficulty, it cannot always be 
rigorously applied.’’”’ An accurate and mathematically convenient 
integrator implementation is obtained via the well-known bilinear 
transform *” 7° 78 





ee ae 
s 


21-27) 23) 


There are several ways bilinear integration can be realized*”*’ with 
active-sc networks, as demonstrated in Figs. 17, 18, and 19. It 1s 
interesting to examine the behavior of each of these circuits. 

Let us initially consider the bilinear integrator” shown in Fig. 17a. 
The z-domain equivalent circuit, obtained by interconnecting blocks 
6b, 6c, 6m, and 6u, is shown in Fig. 17b. By straightforward nodal- 
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Fig. 16—Active-sc lossy integrator with orr damping. 


charge analysis, the following relations are easily obtained: 


—(C,/C2)27'” C1/C. 
out re SA Vin — ow) Vin (34a) 
1-2 | l-2z 
and 
Vou = a outs (34b) 


From eq. (34a), we observe that the desired bilinear integration is only 
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obtained when 
Vie VG. (35) 
Substituting eq. (35) into eq. (34a) yields the desired result 


H(z) = Vit = OCCA + 27) 


Vi; 1-2! om 


also, 


e —1/2 —1 
TE ene (36b) 
Vin l-2z 
In summary, this circuit, with the switches phased as shown in Fig. 
17a, will provide bilinear integration only when the input and output 
are sampled at the odd (2k + 1)T times and the input is held for the 
entire clock period. 

A second bilinear integrator realization’’ is shown in Fig. 18a. The 
z-domain equivalent circuit, shown in Fig. 18b, is obtained by inter- 
connecting blocks 6b, 6e, and 6u. The transfer relations for this circuit 
are readily determined to be 


— =(C1/C2) (1 + 274) 16 /C)2/ 


oa (37a) 
ra 


oO 
out —™ 


(l1—27') 1- 
and 
= al 9 yy l/2 
pe ON eT ye OO ym) 
(l-—2z) 1-z 


The output, sampled at all (both even and odd) kT times, is obtained 
by summing eqs. (37a) and (37b) according to eq. (9) and cancelling 
the common factor (1 + 27”); Le., 
Vout Wout + Wout = —(Ci/C2) (1 + Ze) 

A= ye eg NE 
Comparing equations (38) and (36a), we see that the effective sampling 
rate has been doubled with the circuit in Fig. 18a. Also, bilinear 
integration is obtained independent of the input sampling conditions. 
It is noted that sampling the output of this circuit at only the even 
2kT times or only the odd (2k + 1)T times will result in an erroneous 
output. 

A third bilinear integrator is obtained by simply deleting one TsSD 
from Fig. 18a, as shown in Fig. 19a. It is interesting to analyze this 
circuit and compare the results with that given in eqs. (37) and (38) for 
the integrator in Fig. 18a. The z-domain equivalent circuit in Fig. 19b 
is readily obtained by deleting the appropriate elements from the 
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(b) 


Fig. 17—Active-sc bilinear integrator (Copeland, Ref. 26). 


equivalent circuit in Fig. 18b. The transfer relations for this circuit are 


— =(C\/C2)(1 + 27) 


Sy = (39a) 
and 
—2(Ci/C.)2 
plil > ee Vin $ (39b) 
beet 


Summing eqs. (39a) and (39b) yields 


—-1/2 
Vout = Vou + Vou = See Vin. (40) 
l= 

Note that bilinear integration is obtained when the output is either 
sampled at the even 2kT times or at all (both even and odd) kT times. 
Again, bilinear integration is obtained independent of the input sam- 
pling conditions. This concludes the first-order active sc network 
examples. It should be noted that circuits similar to those in Figs. 13 
through 16 could have been derived using the TsI and TSD elements 
shown in Figs. 6d and 6e or Figs. 7f and 7g. To further illustrate the 
procedure, the equivalent circuit for the fourth-order low-pass, leap- 
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(b) 


Fig. 18—Active-sc bilinear integrator (Temes and Young, Ref. 27). 


frog, active-sc filter, depicted in Fig. 20a, is given in Fig. 20b. This 
equivalent circuit is readily derived from the equivalent circuit blocks 
in Fig. 7 and the principles discussed in this section. The equivalent 
circuit in Fig. 20b is seen to be no more complex than the equivalent 
active-RC circuit. Note that, in the absence of the output even switch, 
the only modification to the z-domain equivalent circuit in Fig. 20b is 
an additional voltage-controlled voltage source at the output. This 
voltage-controlled voltage source defines the relation Vi. = 
z/*Vé.2, as shown previously in Fig. 14c. The verification of this 
circuit 1s left as an exercise for the reader. 


V. CONCLUDING REMARKS 


The powerful tools which we commonly refer to as network or circuit 
theory have been indispensable in advancing the analog filter art to its 
present level of quality and sophistication. Such fundamental concepts 
as transfer functions, poles, and zeros have provided the filter designer 
with quick insight as to the behavior of a given filter. He can then 
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(b) 


Fig. 19—A reduced version of the bilinear integrator in Fig. 18. 


efficiently design the filter by relating pole-zero movements to specific 
circuit elements. It is interesting, in this era of high-speed computers 
and sophisticated analysis programs, that many of the classical net- 
works tools still maintain their important role in filter design. The 
objective of this work has been to make tools'® '’ of this kind more 
accessible to the designers of switched capacitor filters. 

In keeping with this goal, a comprehensive library of building-block 
equivalent circuits has been given. This library extends that given in 
Ref. 17 by providing equivalent circuits for higher order sc elements 
and a variety of sampled data sources. These sc elements, typically 
comprised of one capacitor and from one to four switches, serve as 
circuit elements for sc networks much in the spirit in which resistors 
and capacitors serve analog circuits. Viewing switched capacitor ele- 
ments in this way facilitates the derivation of canonic z-domain equiv- 
alent circuits for complex sc networks and the application of classical 
networks tools to their analysis and synthesis. 

In deriving the equivalent circuits, it is pointed out that there are 
several interpretations and types of link two ports or LTPs. The 
interpretation of the floating LTP as a balanced lattice network, in lieu 
of an unbalanced structure, results in the elimination of two trans- 
formers in the equivalent circuit for a floating capacitor. In addition, 


766 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1979 


Z9Z SHYOMLAN OS YO4S SLINDYIO LNAIVAINDA 





(b) 


Fig. 20—Fourth-order, low-pass, leapfrog active-sc filter. 


Vour (kT) 


out 


unbalanced inverting and differencing LTPs are also identified. An 
equivalent circuit is provided for a general toggle switched floating 
four port (TSFFP) element. The functioning ports are shown to be 
linked via a 90-degree rotated, balanced floating LTP. By applying the 
appropriate termination conditions to the TSFFP, its equivalent circuit 
can be used to derive the equivalent circuit for any single capacitor 
element in the library. 

Several examples were worked out and discussed illustrating the 
case of application of the proposed equivalent circuits and the insight 
gained. Most interesting was the novel circuit depicted in Fig. 12 which 
exhibits a bifunctional capability; namely, a high-pass function at the 
even (2kT) times and a low-pass function at the odd (2k + 1)T times. 
The application to active-sc networks of various complexities was also 
discussed. Particularly noteworthy were the various subtle differences 
among the circuits shown in Figs. 17, 18, and 19, all professed to realize 
bilinear integration. 


VI. ACKNOWLEDGMENTS 


The author is grateful to J. J. Friend, T. M. Chien, and P. E. 
Fleischer for their helpful comments, discussions, and words of en- 
couragement as this work evolved. I would also like to thank R. L. 
Adams and C. F. Kurth for carefully reading the initial manuscript 
and providing valuable constructive criticisms. 


REFERENCES 


1. R. Rao and S. Srinivasan, “Low Sensitivity Active Filter Using the Operational 
Amplifier Pole,” Proc. IEEE (1974), pp. 1713-1714. 

2 R. Schaumann, “Low Sensitivity, High Frequency, Tunable Active Filters Without 
External Capacitors,” IEEE Trans. on Circuits and Systems, CAS-22 (1975), pp. 
39-44. 

3. R. Schaumann, “On the Design of Active Filters Using only Resistors and Voltage 
Amplifiers,” AEU-Elec. Commun. 30 (1976), pp. 245-252. 

4. K. R. Laker, R. Schaumann and J. R. Brand, “Multiple Loop Feedback Active-R 
Filters,” Proc. IEEE International Symposium on Circuits and Systems, April 
1976, pp. 279-282. 

. J. R. Brand and R. Schaumann, “Temperature Compensation of Active R Filter 
Parameters,” 1977 Midwest Symposium on Circuits and Systems, August 1977. 

. K. S. Tan and P. R. Gray, “High Order Monolithic Analog Filters Using Bipolar/ 
IFET Technology,” IEEE International Solid State Circuits Conference, February 
1978, pp. 80-81. 

7. D. L. Fried, “Analog Sample-data Filters,” IEEE J. Solid State Circuits, SC-7 
(August 1972), pp. 302-304. 

8. I. A. Young, P. R. Gray, and D. A. Hodges, “Analog NMos Sampled-Data Recursive 
Filter,” IEEE International Solid State Circuits Conference, February 1977, pp. 
156-157. 

9. B. J. Hosticka, R. W. Broderson, and P. R. Gray, “mos Sampled Data Recursive 
Filters Using Switched Capacitor Integrators,’ IEEE J. Solid State Circuits, SC- 
12, No. 6 (December 1977), pp. 600-608. 

10. J. T. Caves et al., “Sampled Analog Filtering Using Switched Capacitors as Resistor 
Equivalents,” IEEE J. Solid State Circuits, SC-12, No. 6 (December 1977), pp. 
592-599. 

11. G. M. Jacobs, “Practical Design Considerations for Mos Switched Capacitor Ladder 
iron Memorandum No. UCB/ERL M77/69, University of California, Berkeley 

1977). 


ao oo 


768 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1979 


12. 


13. 


14. 


15. 


Y. P. Tsividis and P. R. Gray, “An Integrated NmMos Operational Amplifier with 
Internal Compensation,” IEEE J. Solid State Circuits, SC-11, No. 6 (December 
1976), pp. 748-753. 

J. L. McCreary and P. R. Gray, “All-mos Charge Redistribution Analog-to-Digital 
Conversion Techniques—Part I,” IEEE J. Solid State Circuits, SC-10 (December 
1975), pp. 371-379. 

R. D. Baertsch, W. E. Engeler, H. S. Goldberg, C. M. Puckette, and J. J. Tiemann, 
“The Design and Operation of Practical Charge Transfer Transversal Filters,” 
IEEE Trans. on Electron Devices, ED-23 (February 1976), pp. 133-142. 

C. H. Sequin, M. F. Tompsett, P. I. Sucin, D. A. Sealer, P. M. Ryan, and E. J. 
Zimany, “Self-Contained Charge-Coupled Split Electrode Filters Using a Novel 
Sensing Technique,” IEEE J. Solid State Circuits, SC-12 No. 6 (December 1977), 
pp. 626-632. 


. C. F. Kurth and G. S. Moschytz, ‘“‘“Nodal Analysis of Switched Capacitor Networks,” 


to be published in IREE Trans. On Circuits and Systems, February 1979. 


17. C. F. Kurth and G. S. Moschytz, “T'wo-port Analysis of Switch-Capacitor Networks 


Using Four-port Equivalent Circuits,” to be published in IEKE Trans. on Circuits 
and Systems, March 1979. 


18. D. L. Fraser, Jr. and A. J. Vera, “Design Tools for Switched Capacitor Filters,” Cccp 


Signal Processing Workshop, New York, N.Y., May 1978. 


19. P. E. Fleischer, “Computer Analysis of Switched Capacitor Networks in the Fre- 


quency Domain,” unpublished work. 

. Y. Tsividis, “Switched Capacitor Network Analysis,” ccb Signal Processing Work- 
shop, New York, N.Y., May 1978. 

. M. L. Liou and Y. L. Kuo, “Exact Analysis of Switched Capacitor Circuits with 
Arbitrary Inputs,” unpublished work. 

. A. V. Oppenheim and R. W. Schafer, Digital Signal Processing, Englewood Cliffs, 
N.J.: Prentice-Hall, 1975. 


23. T. Fjallbrant, “Digital Filters with a Number of Shift Sequences in Each Pulse 


24 


25 
26 


27 
28 


Repetition Interval,” IEEE Trans. on Circuit Theory, CT-17, No. 8 (August 1970), 
pp. 452-455. 

. J. R. Ragazzini and G. F. Franklin, Sampled-Data Control Systems, Chapter 3, New 
York; McGraw Hill, 1958. 

. C. A. Desoer and E. 8. Kuh, Basic Circuit Theory, New York: McGraw-Hill, 1969. 

. M. Copeland. “The Bilinear Transform Switched Operator,” ccp Signal Processing 
Workshop, New York, N.Y., May 1978. 

. G.C. Themes and I. A. Young, “An Improved Switched Capacitor Integrator,” Elec. 
Lett. 14, No. 9 (April 27, 1978), pp. 287-288. 

. L. T. Bruton, “Low Sensitivity Digital Ladder Filters,” IEEE Trans. on Circuits and 
Systems, CAS-22, No. 3 (March 1975), pp. 168-176. 


EQUIVALENT CIRCUITS FOR SC NETWORKS 769 


Copyright © 1979 American Telephone and Telegraph Company 
THE BELL SYSTEM TECHNICAL JOURNAL 
Vol. 58, No. 3, March 1979 
Printed in U.S.A. 


Determination of the Basic Device Parameters 
of a GaAs MESFET 


By H. FUKUI 
(Manuscript received July 28, 1978) 


This paper describes a new technique to determine the basic 
properties of the active channel of a gallium arsenide (GaAs) metal- 
semiconductor field effect transistor (MESFET). The effective gate 
length, channel thickness, and carrier concentration are determined 
from dc parameters. A precise method of measuring the dc parameters 
is also given. The new techniques are demonstrated using a wide 
variety of sample devices. It is also shown that microwave perform- 
ance parameters, such as the maximum output power and minimum 
noise figure, are well predicted by dc parameters. Calculated values 
of the intrinsic and extrinsic dc parameters, using simple analytical 
expressions developed in terms of the geometrical and material 
parameters of a device, are shown to be in excellent agreement with 
their measured values. These expressions can be used as a basis for 
device design. 


l. INTRODUCTION 


In a gallium arsenide (GaAs) metal-semiconductor field effect tran- 
sistor (MESFET), the properties of the active channel are fundamental 
in describing its operation. The channel properties can be characterized 
by the four basic parameters: gate length, gate width, channel thick- 
ness, and channel doping. 

In a recent paper,’ the maximally obtainable value of channel current 
was defined as the maximum channel current, J,,. It was pointed out 
that J, differs from either (fully open channel) saturation current, J,, 
or zero-gate-bias drain current, which is often referred to as Lass. 
Currents J, and Ig,;; have conventionally been used to show upper 
limits of the drain current capability. However, neither J, nor Jg,; can 
represent the maximally obtainable value of channel current. It was 
emphasized that [,, plays an important role in determining the maxi- 
mum capability of large-signal operation of the device. Simple expres- 
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sions for J, were then obtained in terms of the four basic channel 
parameters, as a result of an extended study of Shockley’s gradual 
channel approximation’ on Grebene-Ghandhi’s two-section FET model? 
with Fukui’s concept on the current limiting mechanism. ' 

Among the four basic channel parameters, the total gate width, Z, is 
usually a given factor or merely a scaling factor. Therefore, the other 
three parameters are noted to be the most crucial variables in the 
design work. For these three parameters, their effective values were 
adopted in Ref. 1. This was essential, especially for gate length. The 
effective gate length, L, may be either shorter or longer than the 
physical length of gate metallization, L,, depending upon the gate 
junction topography. The channel thickness, a, and carrier concentra- 
tion, N, represent their effective values in the active region of the 
channel. 

In Ref. 1, practical expressions for the zero-gate-bias channel current, 
I,, were also developed as functions of the basic channel parameters 
N, a, L, and Z, and an additional parasitic parameter of source Series 
resistance, R,. An approximate expression for the knee voltage, Vz, 
corresponding to J,, in the drain I-V characteristic, was given by a 
combination of N, a, L, Z, R,, and Ra on a semi-empirical basis, Ra 
being the drain series resistance. In addition, it has been known that 
the total pinch-off voltage, W,, is determined by the Na? product and 
that W, is equal to the sum of terminal pinch-off voltage, V>, and 
Schottky-barrier built-in voltage, V,. 

It is now conceivable that N, a, and L may be determined from the 
measured values of In, I, Vi, Vo, Vo, Rs, and Ra, provided that Z is 
known. The prime purpose of this paper is to present a new technique 
for carrying out this work. Throughout the paper, a transistor curve 
tracer is exclusively used as the tool necessary for measuring the dc 
parameters. However, test equipment of other types with the equiva- 
lent functions can, of course, be used as well. 

There has been a common practice in which either a or N is 
determined from V,, after knowing either N or a, respectively, and 
assuming an appropriate value of V,. Also, Fair showed that an 
iterative analysis on J,, R,, and transconductance, g,,, makes it possible 
to determine N and a from known values of J,, V,, and terminal 
transconductance, gin.’ However, as far as the author knows, there has 
been no published report on an evaluation technique for the effective 
gate length of a finished device. This paper presents such a technique 
as well as simultaneous determination of N and a, from known values 
of t OPR 13 Vp, Vo, Veg, R,, and Rg. 

The second purpose of this paper is to show prediction of the 
microwave performance parameters, such as the maximum output 
power and minimum noise figure, from the dc parameters and hence 
the basic channel parameters. To predict the minimum noise figure, 
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the values of g,, and R,, which is the gate series resistance, must be 
known. Therefore, the determination of g,, and R, are also described 
in this paper. Once the detail of the structure outside the gate channel 
is given, the parasitic parameters, such as R,, Rs, and Ra, can be 
analytically expressed in terms of the geometrical and material param- 
eters of the corresponding sections of the device. The validity of such 
expressions is then examined with experimental results in this paper. 


il. PRINCIPLE OF NEW TECHNIQUE 
2.1 Analytical expressions for device dc parameters 
To determine the basic channel parameters of a GaAs MESFET from 
the measured values of its dc parameters, expressions showing their 
relationships are essential. It has been well known that W, and J, are 
given by 
_ qNa* 


2KEo 





(1) 


Pp 


and 
I, = qu;NaZ, (2) 


respectively,’ in which gq is the electronic charge, € is the permittivity 
of free space, x 1s the specific dielectric constant, and v, is the saturated 
velocity of electrons in n-GaAs. Substituting g = 1.60 x 10°" C, @ = 
8.85 X 10° F/cm, « = 12.5 for GaAs, and a best fit value of v, = 1.4 
x 10’ cm/sec (Ref. 1) into (1) and (2) yields the following practical 
expressions: 


W, = 7.23Na* = V,+ Vo (V) (3) 
and 
I, =0.224ZNa (A), (4) 


where N is in units of 10'° cm™”, a is in pm, and Z is in mm. 
As mentioned earlier, analytical expressions for J, [, and Vis have 
been derived in Ref. 1. First, an expression for J,, is given in the form 


In = BI, (5) 


where £ is the maximum channel opening factor. Parameter £ is 
expressed approximately as 


0.18 /L 
= 1—-— ,/—= 6 
pe1-—— |= (6) 
provided that a best-fit value of 0.29 <x 10* V/cm is assumed for the 
critical electric field, E.. Another approximate expression for J, is 
shown as 
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Im  0.18N*?a!° 
ZF = Ty (A/mm). (7) 
Second, an expression for J, is given in the form 
I, = yl, (8) 
where 
yz lto-—v6+20+0° (9) 
Vy, + 0.23841 
§ = —______—_ (10) 
Wp 
and 
0.0155 R,Z 


Another approximate expression for y is given by 








1 IRs 
= [1- Fell - sah i 
where 
W, = W,+ Ve. (13) 


In (13), V. is a correction voltage that may vary from zero to a few 
tenths of a volt, depending upon the configuration of the channel 
structure. No analytical form is presently available for V.. 

Third, an expression for Vz; 1s given by 


Vig = (1 — BY? W, + I(Re + Ra) + Ve, (14) 
where J; is the maximum value of total forward drain current, including 
the leakage current through the buffer layer and substrate. 

2.2 Determination of basic channel parameters: Case | 


If any one of the basic channel parameters N, a, and L is known, the 
other two can be determined from known values of V,, Vs, and (Im/Z) 
in a straightforward manner, using either a set of (3), (4), (5), and (6), 
or (3) and (7). For example, if the N value is known as a result of the 
epitaxial layer evaluation, a is readily determined by 


Vo + Vo 
a= Vao3sN (um), (15) 


as is well known. The maximum channel opening factor is evaluated 
as 


ie 12In/Z 
VN(V, + Vo) 
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(16) 


Using this B value, L is determined by 
L = 4.27(V, + V,)(1 — 8)? (um). (17) 


In the case that Z (or a) is known, similar evaluation for N and a (or 
N and L) can also be carried out as well. 

Among dc parameters, [,, and W, (= V, + V,) can be considered to 
be primary, because they are determined only by the active channel 
properties which are intrinsic. Other dc parameters, such as Jp and V;,, 
are secondary, since they are affected by extrinsic elements outside 
the gate channel region. The basic channel parameters can be deter- 
mined only from the primary dc parameters, if any one of the three 
basic parameters for the device under evaluation is known. This is a 
characteristic of case [. 


2.3 Determination of basic channel parameters: Case Il 


In this case, none of the three basic channel parameters is known. 
For the determination of these parameters, the secondary dc param- 
eters play the major role and the primary dc parameters remain 
auxiliary. There are two ways to determine the basic channel param- 
eters in this category. 

The first method is based on parameter Jp. Rearranging (8) and (12) 
yields 


pw XMot Vo+Vel 5 fp PRs (18) 
; Rs JVo4 Viet Ve 


After knowing J,, N and a can be determined from (3) and (4), 
respectively, as follows: 


1 TOU eee ce 
a, Fd lca aa 
and 
Z 
a = 0.031 (V, + Vo) 7 (uum). (20) 


Now (17) can be used with (5) to obtain L as 
I 2 
L = 4.27 (V, + val 1- 2 (um). (21) 


The second method is to utilize parameter V;,;. Rewriting (14) yields 


Ver = (Rs + Ra) -— V~ 
=li- ———_- 
B V Vie Vv: (22) 


After knowing the 8 value, N and a can be determined by (19) and 
(20), respectively, provided that J, = I,/B. The L value can be directly 
determined from (14) and (17) as 
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L = 427 [Vis — IRs + Ra) — Ve] (um). (23) 


The first method demands the known values of Z, Vp, Vo, Ve, Io, Im, 
and R,, whereas the second method requires Z, V,, Vs, Vi, Ver, Ip, Rs; 
and Raz already known. In the computation process of determining N, 
a, and L, the subtraction of two major terms is included in both cases. 
Therefore, chances of introducing an intolerable error are inevitable. 
Thus, taking only a single method is not advisable. The results ob- 
tained from one method have to be checked with the other method. 
The simplified relationship given in (7) could be conveniently used as 
a guide to examination and adjustment. Expressions (8) through (11) 
for I, could also be applied for an additional checking of the determined 
values of N, a, and L, in comparison with the directly measured value 
of I,. Some adjustments on temporarily determined values of the 
parameters are often necessary to reach their most probable values. 


Il. MEASUREMENTS OF DC PARAMETERS 


As previously mentioned, a transistor curve tracer is used as the test 
instrument in this paper. A good calibration of the measuring system 
is essential. Not only the curve tracer but also the test fixture must be 
taken into consideration. For example, lead resistances may introduce 
an intolerable measuring error in large-size devices. An excessive 
leakage current may mislead the determination of junction parameters. 
Instability and/or relatively low-frequency oscillation, often taking 
place in a high-performance device, are an annoying phenomena and | 
require a special skill to suppress. 

In the following sections, measuring methods for the dc parameters 
of a GaAs MESFET are described. Although some methods have been 
known or are easily derived from known methods, they are included 
with brief descriptions for completeness. The ideality parameter of a 
gate junction, n, and the open channel resistance, R., are needed 
neither for determination of the basic channel parameters nor for 
prediction of the microwave performance parameters. Nevertheless, 
they are secondarily obtained in the course of determination of the 
primary parameters. As they may be relevant to a further study of the 
device, their determination is also described. 


3.1 Determination of gate barrier built-in voltage and ideality parameter 


As is well known,” the forward current density, J, of a Schottky 
barrier junction for V > 3kT/q is approximately written as 


J=A'T- exo| - Tr Jexo| 2 (24) 


where A”* is the effective Richardson constant, T is the junction 
temperature in °K, & is the Boltzmann constant, n is the ideality 
parameter, and V is the forward bias voltage. 
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The extrapolated value of current density to zero bias gives the 
saturation current density, J,. The barrier built-in voltage is then 
obtained from 





* 72 
ee in| * r | (25) 
q J 
The ideality parameter is given by 
_q <aV 
"RP alin J)’ ) 


Figure 1 is a multi-exposed photograph of the forward I-V charac- 
teristic of a gate junction at room temperature. As described in the 
caption, each curve corresponds to a current range in a decimal step 
for several orders of magnitude. In the highest current range, the gate 
was against three different connections, i.e., source and drain com- 
bined, source alone, and drain alone, in order to differentiate Rz, Rs, 
and Ra from each other later. 

The J-V characteristic given in Fig. 1 1s plotted as shown in Fig. 2. 
At high values of the gate bias, V,, the gate current, J,, tends to 
saturate due to the series resistance effect. At low values of V,, J, is 
often disturbed by a leakage current component around the gate 





Fig. 1—A multi-exposed photograph of the forward I-V characteristic of a gate 
junction at room temperature. Each curve corresponds to a current range of decimal 
step. In the highest current range, three different ground connections are taken against 
the gate. They are the source and drain connected together, source alone, and drain 
alone. For all lower current ranges, the drain is connected to the source. The horizontal 
scale is in units of 0.1 V/div. The vertical scales are, left to right, 1 nA/div, 10 pA/div, 
100 pA/div, 1 mA/div, 10 mA/div (source and drain together), 10 mA/div (source only), 
and 10 mA/div (drain only). 
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ALUMINUM GATE AT 
ROOM TEMPERATURE 


Vi, = 0.785 V 


GATE CURRENT Izy (A) 





0.4 0.5 0.6 0.7 0.8 0.9 1.0 
GATE FORWARD-BIAS VOLTAGE Vg (V) 


Fig. 2—The forward J-V characteristic of an aluminum gate diode at room tempera- 
ture. The slope and the location of its linear portion on a semi-log paper give the ideality 
parameter, n, and the gate built-in voltage, Vs, respectively. 


periphery or with the package. In the middle range where the log J, vs 
V, characteristic is linear, two gate biases, Vim) and Vyyn—1) in V, can 
be chosen corresponding to J, = 10” and I, = 10” in A, respectively. 
Usually, m takes a negative value. 

If the effective mass of electrons in n-GaAs were taken into account, 
the effective Richardson constant would be 8.7 A/cm?/°K at room 
temperature.’ Expression (25) for V, can then be reduced to the 
following practical form at room temperature: 





V, = 0.768 — 0.06 log J, (V), (27) 
in which 
J, = au (107~* A/cm’) (28) 
a A 
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] 
y= 12497 — (29) 
1= Vgim-1) 


Vgc) 


and L, is in units of ym and Z is in mm. A formula to be used for 
deriving n is deduced from (26) as 


n= 16.8[ Vim) = guns (30) 


3.2 Determination of pinch-off voltage, active channel resistance, and 
parasitic series resistances 

Figure 3 shows the drain I-V characteristics in the so-called linear 
region. The characteristics were taken with the lowest scale of Vu, on 
the curve tracer, in which Vz, was the drain-source bias voltage. Each 
characteristic corresponds to a gate-source bias voltage, V,z;. The drain 
current, Ia, at Va, = 0.05 V is then plotted as a function of V,, as shown 
in Fig. 4. The terminal pinch-off voltage, V,, can temporarily be 
determined by an extrapolation of the plot to the abscissa. 

The current shown in Fig. 4 can now be converted into the resistance 
value as Ra, = Ia/Va;. Such a resistance for Vaz; = 0.05 V is shown in 
Fig. 5 as a function of V,, Vz, and V,;, in the same manner as used in 
Ref. 8, as compounded in parameter X defined as 


1 


> ————E—— Es 
ies Ae V ps 


(31) 


; $ : 
$ i § 


ee: 
‘ 3 j 
Fj £ 
* ny ; | i 
3 Merman nntenpenincsratnnntnatfiensennentemlpte tonto jean anihseethes ltisedpasialnaneans apaaganang 
: ; + i : 
a 3 





Fig. 3—An expanded view of the drain J-V characteristics in the so-called linear 
region near the origin on gate-bias offset mode (+0.4 V in this case). 
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Vds =0.05V 


DRAIN CURRENT I, (mA) 





0.5 0.4 0.3 0.2 0.1 0 -0.1 -0.2 -03 -0O4 -0.5 "0.6 
GATE BIAS VOLTAGE V4, (V) 


Fig. 4—Drain current as a function of gate bias voltage in both forward and reverse 
directions at a drain-source bias voltage, Vu,, of 0.05 V. An appropriate extrapolation of 
this curve to the abscissa gives the external (or terminal) pinch-off voltage, V,. A 
misjudgment in the determination of V,, as shown by the dash-dotted line, will cause a 
problem in the next step. 


This plot should be a straight line. However, the plot may deviate from 
the line either upward or downward as X increases, as indicated by 
dashed lines in Fig. 5. Such a deviation depends upon the temporarily 
determined value of V,. If the plot significantly departed from the 
straight line, the previously determined V, had to be re-examined. 
Usually, a slight adjustment on the V, value easily solves this problem 
and JV, is finally determined. 

Such a way of determining V, seems to be much more complicated 
and tedious than the conventional method. Usually, V, is simply 
estimated from V,, corresponding to the bottom line of the drain J-V 
characteristics. Indeed, the conventional method may not bring too 
much error in the determination of V, in the case of thick active 
channels, i.e., devices with high pinch-off voltages. However, in devices 
with thin active channels, especially on buffer layers, the error may 
reach as high as 100 percent with the conventional method, as will be 
seen in Fig. 6. Therefore, the present method has been developed. 

In Fig. 5, the linear extrapolation of the plot to the ordinate gives 
the value of (R; + Ra). The slope of the line is designated as R,. The 
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parameter R,X represents the effective value of active channel resist- 
ance at a given Vyz.. 

As was mentioned previously, the forward gate current value is 
affected by a combination of series resistances at high current levels. 
Therefore, the slope of the J,-V, characteristic, measured at a current 
density of around 10* A/cm’, gives an estimate of series resistance 
values. By measuring the gate current in three different ground con- 
nections, 1.e., source and drain combined, source only, and drain only, 
three resistance values can be obtained. The differences between the 
last two values yields (R, — Ra). Since (R, + Ra) has been known, R, 
and #4, are now readily separated. The gate series resistance R, can be 


0.356 V 


V, = 0.300 V 
/ 


0.400 V 


DRAIN-TO-SOURCE RESISTANCE IN OHMS 


R, + Ry = 5.46Q 





—1 
X= 1 Ve— Vos 
V Vot Vp 


Fig. 5—Determination of the open channel resistance, R., and parasitic series resist- 
ances, R, and Rag. The proper selection of V, value in Fig. 4 (i.e., V, = 0.356 V in this 
case) is essential for this determination, as illustrated in two wrong cases (V, = 0.300 V 
and 0.400 V). 
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deduced from the first resistance value by subtracting the contribution 
of the paralleled R, and Ra from the resultant. 


3.3 Determination of specific voltages and currents 


Figure 6 is a typical photograph of the drain J-V characteristics for 
negative gate potentials, taken with the nonoffset gate-bias mode of 
the curve tracer. In contrast with Fig. 6, Fig. 7 is an unconventional 
photograph of the drain characteristics of the same device when driven 
in the forward gate bias with the offset mode. As the positive value of 
gate offset is increased, the drain current increases. Beyond a certain 
value of the offset, V;, however, the drain current no longer increases. 
Figure 7 shows such a state of offsetting. 

The knee voltage of a drain IJ-V characteristic could be defined as 
the intercepting point between the extensions of two linear regions of 
the characteristic. The knee voltage of the J-V curve for V; is denoted 
by Viz as shown in Fig. 7. Also, the knee voltage for the zero-gate-bias 
curve is by V;. as shown in Fig. 6. Total drain currents, J; and Ia, for 
V, and null gate bias, respectively, are measured at the corresponding 
knee voltages, Vir and V;.. Leakage current components, Jp; and Ipo, 
are also measured at Viz and Vz, respectively, both for Vz. = —V>. 
The maximum channel current, J,,, and zero-gate-bias channel current, 
I,, are then evaluated as I; — Ip¢ and Lao — Ino, respectively. 





Fig. 6—A conventional drain J-V characteristics with nonoffset gate-bias mode as 
usual. 
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Fig. 7—-A special drain J-V characteristics with a forward gate-bias offsetting of V;. 
This is a critical value beyond which no increase in the drain current is observed. 


3.4 Determination of transconductance 


As is well known, the magnitude of the transconductance of a good 
device can be assumed to remain constant up to nearly the cutoff 
frequency. Therefore, the so-called dc transconductance can be consid- 
ered a first-order approximation of the amplitude of microwave tran- 
sconductance. 

The following method of measuring g;, and evaluating g,, 1s conven- 
tional. In the drain J-V characteristics, an increment of drain current, 
Ala, between two adjacent curves for V,; = Vi and Vz; = V2 at a given 
Vas yields an average transconductance as 
Ala 


; 32 
VV, (32) 


Em = 








However, this value. is a result of degradation due to R,. The intrinsic 
value of the small-signal transconductance at the bias points, Va, = 
Vas and Vz, = (Vi + V2)/2, can thus be derived as 
Em 
2 2 —_——.. 33 
IV. EXAMINATIONS OF NEW TECHNIQUE 
4.1 Sample devices 


To present the practical values of measured dc parameters and 
hence to demonstrate the new technique for determining the basic 
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channel parameters, 11 GaAs MESFETs of various designs were chosen, 
as shown in Table I. The first five devices (A through E) were originally 
designed for high-power use’ and the others (F through K) were for 
low-noise applications.'° 

All devices had a total device width of 0.5 mm, except for device E 
which had 1 mm. The distance between the source and drain electrodes 
was nominally 6 ym for devices A through E and 3 um for devices F 
through K. The nominal gate length was 2.0 to 2.5 xm for devices A to 
D, 1.0 to 1.5 ym for device E, and 0.8 wm for devices F to K. The 
physical length of the gate electrode, however, was not necessarily 
equal to the effective gate length because the latter was subject to the 
shape of a gate junction. 

All the sample devices had a multi-layer structure consisting of an 
undoped n-GaAs film 2 to 3 um thick as the buffer, an n-GaAs channel, 
and n*-GaAs layer, except for devices J and K. All the layers were 
grown sequentially on a semi-insulating GaAs substrate in an AsCl;/ 
Ga/H: cvp system.'’ After removing the n*-GaAs layer and part of n- 
GaAs in the gate region, aluminum approximately 0.7 wm thick was 
deposited as the Schottky-gate metal. The ohmic contacts were formed 
with a 12 percent Ge/Au-Ag-Au system approximately 0.25 um thick, 
alloyed at nearly 500°C. The final metallization was completed with a 
Ti-Pt-Au system 0.9 to 1.4 um thick. 


4.2 Results of measurements 


First, dc parameters n, Vz, Vp, Wp, Vir, Ve, Lp, pp, Im, Who, Ido; Ipo, Lo, 
R., Rs, Ra, Rg and gm were obtained in accordance with the measuring 
technique described in Section III. The measured values of these 
parameters are shown in Table I. Note that g;, is an average value 
taken at approximately J,. 

Second, basic channel parameters L and N were deduced from each 
of the two methods described in Section 2.3. In the application of (18), 
(22), and (23), V. was assumed to be zero for devices A to E and to be 
a single value of 0.17 V for devices F to K. The two deduced values for 
each of L and N were then averaged to obtain the most probable value. 
Using this mean value of N in (15), the most probable value of a was 
determined. All values mentioned here are shown in Table II. 


4.3 Comparison of calculated and measured results 


By inserting the determined values of N, a, and L into (4), (6), and 
(5), Im was calculated. The calculated value of [,, was then compared 
with the measured value as shown in Table III. By adopting the 
measured value of V, and FR, in (10) and (11), respectively, J, was also 
calculated using (9), (4), and (8). The calculated J, was compared with 
the measured value, again as shown in Table III. The comparison has 
shown excellent agreement between the calculated and measured 
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Parameter 
Symbol Units 
Z mm 
z mm 
n 
V, V 
V> V 
W, V 
Var V 
V; V 
I, f mA 
I pf mA 
is mA 
Vro V 
I, do mA 
oe mA 
i. mA 
R, Q 
R, 2 
Ra 2 
Rz 2 
Bm mi 


A 
0.5 


1.09 
0.72 
0.84 
6.56 
1.95 
1.49 
176 


175 
122 


122 
4.1 
4.8 
2.4 
3.8 

28 


Table I—NMeasured values of dc parameters 
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Table Il—Determination of the most probable values of active channel carrier concentration and thickness and 


effective gate length 


Parameter Device 
Equations 
Used Symbol Units A B C D E F G H I J K 
V.. V 0 0 0 0 0 0.17 0.17 0.17 0.17 0.17 0.17 
(18, 19) N. 10'° cm~? 6.01 5.98 11.19 21.16 7.96 3.52 6.34 7.75 9.84 7.59 7.91 
(18, 21) L pm 3.07 2.06 271 2.08 2.04 0.568 0.466 0.537 0.502 0.491 0.486 
(21, 5, 19) N 10'* em? 5.86 6.15 10.58 20.13 7.78 3.51 6.78 7.76 9.83 7.23 8.31 
(23) L ym 2.91 2.22 2.47 1.97 1.92 0.563 0.545 0.532 0.503 0.436 0.538 
(averaged) N 10'* cm 5.94 6.07 10.89 20.65 7.87 3.52 6.56 7.76 9.84 7.Al 8.11 
(averaged) L um 2.99 2.14 2.59 2.03 1.98 0.566 0.508 0.534 0.502 0.463 0.512 
(15) a pm 0.391 0.394 0.233 0.114 0.314 0.273 0165 0.140 0.134 0.155 0.138 
Table Ili—Calculated values of the maximum channel current and zero-gate-bias channel current and their 
comparison with measured values 
Parameter Device 
Equations 
used Symbol Units A B C D E F G H I J K 
(4, 5, 6) In mA 175 195 177 134 395 = 79.0 84.4 80.6 103 91.0 84.4 
(4, 8, 9, 10, 11) I, mA 118 133 99.0 36.2 272 =: 31.0 18.5 11.0 23,1 18.4 12.1 
Im (cal)/Im (meas) 1.000 0.999 1.000 1.000 1.000 1.000 0.999 1.001 1.000 1.000 0.999 
I, (cal)/I, (meas) 0.965 1.002 0.971 0.953 1.069 1.058 0.973 0.916 1.076 1.054 0.966 


values in both cases. Note that the equations applied to calculate J[,, 
and J, had not been used to determine L, a, and N in Section 4.2. 

The 11 devices used in this experiment were from 11 different slices. 
The free carrier concentration of each slice was evaluated by a doping 
profiler.’ It has been recognized that this particular profiler gives an 
N-value 5 to 40 percent higher than the true value of N."° Also, a 
standard deviation of + 3 percent in the doping across a wafer has 
been known for these slices.'' Under such circumstances, the evaluated 
value of N for each device was compared with an uncorrected, repre- 
sentative N value obtained for the corresponding slice. The results 
shown in Table IV seem to be very reasonable. A consistent pattern of 
difference between the two N values is seen there, as expected from 
the above observation. 


V. ANALYSES OF EXPERIMENTAL RESULTS 
5.1 Schottky-barrier built-in voltage 


The built-in voltage of an aluminum Schottky-barrier gate junction 
at room temperature has been expressed in an analytical form as a 
function of N in n-GaAs’ as follows: 


V, = 0.706 + 0.06 log N_ (V), (34) 


where N is in units of 10'° cm~*. This expression is compared with the 
measured value of V, for all devices used, as shown in Fig. 8. This 
comparison indicates that (34) can be used for aluminum gates on 
n-GaAs at room temperature. 


5.2 Transconductance 


The small-signal transconductance of a GaAs MESFET has been 
described by the theoretical expression” 


I, 


I 3 
2w,| 1-2 


s 


8m = (35) 


where J, is the de drain bias current. Since g;, was measured approx- 
imately at J,, the theoretical value of g,, was also calculated for [4 = 
I, in (85). This calculated value was then compared with the measured 
value obtained using (33). This comparison led to a conclusion that 
the measured value of g,, would agree well with the predicted value if 
the latter were calculated by 


0.9 I; 


Em ~ (36) 


$ 


Ie] 
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Table IV—Comparison of the most probable value of free carrier concentration in the active channel of an individual 


device with an uncorrected value of doping in the epitaxial layer of the corresponding slice 


Parameter Device 
Symbol Units A B C D E F G H I J 
N (slice) 10°° em? 6.7 7.4 14,2 24.6 8.8 4.1 7.1 9.0 10.4 8.1 
uncorrected 
N (slice) /N (device) 1.13 1:22 1.30 1.20 1.12 1.16 1.08 1.16 1.06 1.09 


uncor. most prob. 


11.2 
1.38 


0.84 


ALUMINUM GATE AT 
ROOM TEMPERATURE 


0.80 


THEORETICAL 


0.76 


0.72 


SCHOTTKY—BARRIER BUILT-IN VOLTAGE Vp (V) 





0.68 
2 3 4 5 7 10 20 30 8640 


FREE CARRIER CONCENTRATION N (10!6 cm-3) 


Fig. 8—Comparison of the measured values of gate built-in voltage for aluminum 
Schottky-barrier gates at room temperature, with a theoretical expression as a function 
of free carrier concentration in n-GaAs. 


where I, was the channel current, as shown in Table V. 


5.3 Ohmic contacts and series channel resistance 


The parasitic series resistance, R, or Ra, consists of the ohmic 
contact resistance, R.., and series channel resistance, R.,, between the 
two concerned electrodes. These component resistances are now sep- 
arately expressed. The expression for R.. given by Berger’* and 
Murrmann and Widmann” can be simplified, as already shown by 
Macksey and Adams” as 


1 Rep goo 1 o— 
Reo = V th ee : at 
Z aj a Roa Z a ( 
The series channel resistance can be further divided into two compo- 


nents which represent two individual parts of different structures in 
the space between the gate and one ohmic contacts. 

















Thus, 
Rn = Ro + Rs, (38) 
where 
pele 
Ro= 
2 = Fe, (39) 
and 
p3.L3 
R3 = ; 
3 Was (40) 
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Table V—Measured and predicted values of transconductance at 
zero-gate-bias channel current 


Parameter Device 


&m @ I, A B C D KE F G H I J K 


Measured (m0) 32 36 46 69 83 33 48 58 59 48454 838656 
Predicted (m8) 33.6 35.2 47.4 71.2 81.9 35.1 50.0 554 609 52.1 56.2 


In the above expressions, @j=1,23, Pi=1,2,3 and L;-2,3 are, respectively, the 
thickness, specific resistivity, and length of the GaAs epitaxial film at 
the corresponding place i = 1, 2 or 3, L, is the length of the contacting 
metal electrode, and R, is the specific contact resistance. 

Parameters p and R, are both functions of N. Using the experimental 
data taken by Matino on epitaxial n-GaAs films,'’ the doping depend- 
ence of p can be written in an analytical form 


p = 0.11 N°? (Q-cm) (41) 


in an N range of 107’ to 10° x 10° cm™. Based on the so-called 
Shockley method,® R. was statistically investigated using monitor 
areas provided within the same slices as those fabricated for either 
high-power” or low-noise use.'° An empirical expression for R. was 
then found to be 


R.=4N~°? (107° Q-cm’) (42) 


for N values in the range of 3 to 10° x 10'° cm~°. This expression differs 
from that given by Heime et al.,’* which is 


R.=8N" (10-°Q-cm’). (43) 


However, both expressions give close values of R. for N in the vicinity 
of the mid-10'° cm™ range. As the N value increases, the difference in 
R. between the two expressions becomes recognizable. This would give 
rise to the case of ohmic contacts formed on n*-GaAs layers. An 
estimate by (43) would result in too-optimistic prediction of R.. 

Substitution of (41) and (42) into (37), (39), and (40) yields practical 
expressions for R.o, Re, and Rs as follows: 


2.1 
Ro = Faring (O) ia 
— lids 
haere) mas 
1.1 D3 
~~ —_——. (9). 
and a Za3;N3*" ~) 79) 


For the 11 sample devices with reasonable assumptions on Nj, aj, 
Lez, Neo, a2, Ls,, Ns, and a3, component resistances R.., Re, and Rs 
were calculated using (44), (45), and (46), respectively. The predicted 
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value of R, (or Rv) was thus obtained as the sum of these resistances. 
The average of the measured values of R, and Ry for each device was 
then compared with the predicted value, as shown in Table VI. They 
were In good agreement. 


5.4 Gate series resistance 


Since the input signal applied to the feeding end of the gate travels 
along the gate metallization to the other end, the gate must be 
considered as a distributed network. The effective value of the gate 
metallization resistance in a lumped equivalent circuit 1s, therefore, 
different from the dc value measured from one end to the other. As 
theoretically analyzed by Wolf,’” this effective value, R,, is one-third 
of the end-to-end dc resistance as a first-order approximation. Thus, 





~ Per 
“ 3L,AZ’ we 
where pz is the specific resistivity, L, is the mean length, A 1s the mean 
height, and z is the unit width of the gate metallization. 

By substituting the nominal values of Z, z, L, and h into (47) in 
conjunction with the measured value of R,, p, was evaluated as shown 
in Table VII. The range from 3.8 to 5.7 X 10°° Q-cm gives a mean 
value of 5.0 X 10°° Q-cm. Although this value is higher than a bulk 
aluminum resistivity of 2.8 x 10°° Q-cm, it seems to be very reasonable 
for such a fine, thin, and scaly structure of gate metallization. Based 
on this finding, a practical expression for R, can be given by 


) 


2" 


where Z and z are in units of mm and L, and A are in pm. 

It may be noted that annealing of a device sometimes results in an 
improvement in the effective value of p,. The above measured values 
were obtained without additional heat treatment. 


5.5 Open channel resistance 
The open channel resistance mentioned in Section III can be ex- 
pressed as 
L 


| We eo eg 4 
quoNaZ u) 


where pio is the low-field mobility of electrons.* Thus, po of an individual 
device can be obtained from the measured value of R.,, in conjunction 
with Z, L, a, and N, which have already been determined. 

On the other hand, po is related to p in the form 


1 


Lo = QNb ; (50) 
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Table Vi—Predicted values of parasitic source (or drain) series 
resistance and its component resistances in comparison with measured value 


Parameter Device 
Symbol Units A B C D E F G H I J 
Z mm 0.5 0.5 0.5 0.5 1.0 0.5 0.5 0.5 0.5 0.5 0.5 
N, 10'° cm 100 100 50 100 100 100 100 100 100 7.4 8.1 
ay um 0.15 0.15 0.15 0.15 0.15 0.15 0.15 0.15 0.15 0.35 0.45 
Ly um. 21 2.1 2.1 2.1 2.5 0.85 0.75 0.85 0.85 0.75 0.75 
N2 10'° cm? 5.9 6.1 10.9 20.7 7.9 100 100 100 100 7.4 8.1 
ay um 0.35 0.35 0.2 0.1 0.35 0.1 0.1 0.1 0.1 0.25 0.35 
L; um 0 0 0 0 0 0.4 0.3 0.4 0.4 0.3 0.3 
N3 10'° cm™ — — — — — 3.5 6.6 7.8 9.8 74 8.1 
a3 um a — — — — 0.273 0.165 0.140 0.134 0.155 0.138 
Reo Q 0.52 0.52 0.82 0.52 0.26 0.52 0.52 0.52 0.52 1.89 1.57 
: Q 3.07 3.01 3.26 3.86 1.45 0.68 0.59 0.68 0.68 1.28 0.85 
R3 Q 0 0 0 0 0 1.15 0.86 117 1.00 1.13 0.86 
Predicted Q 3.59 3.53 4.08 4.38 1.71 2.35 1.97 2.37 2.20 4.30 3.28 
Ry, Ra 
Measured Q 3.6 3.65 4.25 4.5 1.75 27 17 2.35 2.3 4.3 3.15 
(R, + Ra)/2 
Table Vil—Determination of the effective resistivity for aluminum gate metallization 
Parameter Device 
Symbol Units A B C D E F G H I J K 
Z mm 0.5 0.5 0.5 0.5 1.0 0.5 0.5 0.5 0.5 0.5 0.5 
2 mm 0.5 0.25 0.25 0.5 0.25 0.25 0.25 0.25 0.25 0.25 0.25 
ty pm 2.5 2.0 2.0 2.0 1.2 0.8 0.8 0.8 0.8 0.8 0.8 
h um 0.7 0.7 0.7 0.7 0.2 0.2 . 0.65 0.65 0.65 0.65 0.65 
Rg Q 3.8 1.7 1.7 4.5 4.4 13:7 3.8 3.7 4.0 4.5 3.8 
De 10-° Q-cm 4.0 57 5.7 3.8 5.1 5.3 4.9 4.8 5.0 5.6 49 


Substituting (41) into (50) yields po as a function of N 
wo = 5.7 N~°'* (10°em"/V-sec). (51) 


The po value deduced from the measured value of AR, and that 
predicted by (51) are shown in Table VIII for all the sample devices. 
The evaluation of the latter value was based on the average value of 
N shown in Table II. In the high-power devices, these two values of 
uo were close enough to confirm Matino’s results'’ and to support the 
use of (41) and (51). In the low-noise devices, however, the to value 
deduced from R, appeared to be approximately one-half the predicted 
value by (51). Such a discrepancy might be caused by a special two- 
dimensional gate-recess structure of these devices, and/or be due to 
an increased influence of the transition layer as the active layer was 
thinned, this transition layer being between the active and buffer 
layers. This is subject to further investigation. 

As shown in Table VIII, the po value deduced from the measured 
value of R, using (49) differs from one device to the other in some 
degree. In accordance with the two-piece approximation of the v-E£ 
characteristic, however, jto is assumed to be constant by definition and 
is equal to v,/E,. Substituting the aforementioned best-fit values of v, 
and E, into this yields a fixed value of 4.8 x 10° cm?/V-sec for io, which 
is independent of N. This po value is much greater than that deduced 
from FR, by (49) and that calculated by (51) as a function of N in the 
normal range. On the contrary, the fixed values of v, and E. have been 
satisfactory to evaluate I,, for a wide range of N, as seen in the previous 
sections. Such a conflicting situation must be reconciled. This is also 
subject to further study. Nevertheless, the following can be applied in 
practice, for the time being. The fixed values of v, and E, are appro- 
priate for the evaluation of J, for all devices. The to value provided by 
(51) is adequate in (49) to calculate R, for devices with plane gates. 
However, a suitable correction factor is necessary for po given by (51) 
to make pio effective for nonplane gate devices. For example, this factor 
was 0.5 for the low-noise devices used as samples. 


Tabie Vill—Comparison between the low-field electron mobility value 
deduced from the measured value of open channel resistance by 
(49) and that predicted from the most probable value of free-carrier 
concentration in the active channel by (51) 


Parameter Device 
wo in 10° cm?/V-sec A B C D E F G H I J K 
Calculated by (49) 39 3.7 44 32 42 18 20 21 #19 19 189 
Calculated by (51) 41 41 37 33 39 45 41 39 38 40 39 
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VI. PREDICTION OF MICROWAVE PERFORMANCE 
6.1 Maximum output power 


As was previously mentioned, devices A through E were originally 
designed for high-power use. The maximum available output power, 
Pmax, of these devices were measured at 4 GHz in a coaxial system. A 
double-slug tuner was provided in each of the input and output circuits 
to obtain the conjugate match. The data taken at a drain bias of 12V 
are shown in Table IX. 

The maximum output power delivered from a GaAs MESFET oper- 
ating at a drain-source voltage of Vu; is approximately given by 

: 

Pra = ra (Vas — Var) (52) 
if Vu, 18 sufficiently lower than the drain-source breakdown voltage. 
By substituting the measured values of J, and Vi; into (52), Pmax was 
calculated for Vz, = 12V. Furthermore, Pmax was predicted using the 
geometrical and material parameters, shown in Tables II and VI, to 
calculate J,, and Vi; Both predicted values of Prax are shown in 
Table IX. 

As seen in Table IX, there is excellent agreement between the 
measured and predicted values of Pma, in devices C, D, and E at this 
drain bias voltage. However, the measured values of devices A and B 
were substantially smaller than the predicted values. This discrepancy 
could be caused by the saturation effect in output power as Vas 
increased. This problem of power saturation will be discussed in a 
separate paper. Without power saturation mechanisms, the maximum 
output power capability at microwave frequencies can be predicted 
by dc parameters as well as by device geometrical and material 
parameters. 


6.2 Minimum noise figure 


Devices F through K were originally designed for low-noise ampli- 
fiers. The minimum noise figure, Finin, of these devices was measured 
at 5.92 GHz in a coaxial system with double-slug tuners. The measured 
values are shown in Table X. 


Table IX—Predicted and directly measured values of the maximum 
output power at 4 GHz and 12V drain bias 


Parameter Device 
Prax @ 12 volts A B C D EB 
Measured directly (W) 0.40 0.40 0.44 0.36 1.00 
Predicted from measured dc para’s (W) 0.439 0491 0.437 0.3850 1.001 
Predicted from geo. and mat. para’s (W) 0.439 0493 0.440 0.3853 1.006 
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Table X—Predicted and directly measured values of the minimum 
noise figure at 5.92 GHz 
Parameter Device 
Fin @ 5.92 GHz F G H I J K 
84 1.74 1.75 1.76 


Measured directly (dB) 2.22 1.51 1 
Predicted from measured dc para’s (dB) 2.21 1.50 1.80 1.73 £1.75 ~~ 1.76 
Predicted from geo. and mat. para’s (dB) 2.12 1.56 1.79 1.70 £1.72 ~~ 1.80 


A simple expression for Pinin has been derived as 
Fin = 10 log [1 + KfL Vgn(Rz + Rs)] (dB), (53) 


where fis the frequency of interest in GHz and K is the fitting factor.” 
This fitting factor, which represents the channel material properties, 
ranges from 0.25 to 0.3 in most cases. Substituting (33) into (53) with 
a typical K-value of 0.27 yields 


Fin © 10 log E + 0.27 fL \Seee| (dB). (64) 


Using the measured values of gm, Rz,, and R;, and deduced value of L 
in (54), Pmin was calculated for devices F to K. The minimum noise 
figure was also predicted from the geometrical and material param- 
eters, using the values shown in Tables II, VI, and VII to calculate g,,, 
R,, and Rg. 

These predicted values are compared with the directly measured 
value in Table X. The agreement is excellent between them. This 
supports the idea that the de characterization of a low-noise GaAs 
MESFET makes it possible to predict Fmin at microwave frequencies 
with a remarkably high accuracy. Also, once the geometrical and 
material parameters are given for a device, its Pinin can be calculated 
as well. It would be worthwhile to note that, if the operating frequency 
approaches the cutoff frequency of a device or frequencies where the 
skin effect on the gate metallization becomes significant, an additional 
term is required in (54) for an improved accuracy.” 


Vil. SUMMARY OF RELATIONSHIPS 


Table XI is a summary of the relationships between the dc and rf 
performance parameters and the geometrical and material parameters 
of a GaAs MESFET. 


Vill. CONCLUSIONS 


This paper complements the recent study of a new model of the 
GaAs MESFET.’ A new technique has been introduced in which the 
basic channel parameters, such as the effective gate length, channel 
doping, and channel thickness, are determined from the so-called dc 
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Table Xi—Summary of relationships 


dc Participating Geometrical 
Parameter Equations and Material Parameters 
Vp (34) N 
W, (3) N, a 
Vp (3, 34) N, a 
I, (4) N, a, Z 
B (6) N, a, L 
di (5, 6, 4) or (7) N, a, L, Z 
R,, Ra (44, 45, 46) N, N’, Q\, A2, Q3, Lz, D3, Z 
Rz (48) Z, 2, h, Les Pg 
I, (8, 9, 10, 11, 4) N, a, L, Z, Rs 
or (8, 12, 13, 3, 4) N, a, Z, Rs, Ve 
Vor (14, 3, 4, 5, 6, 44, 45, 46) N, a, L, Z, Rs, Ra, Ve 
Performance Above Parameters 
Parameter Equations plus Bias Parameter 
Zn (36, 3, 4) N, a, L, Z, I. 
Zin (36, 33, 3, 4, 44, 45, 46) N, a, L, Z, Rs, I. 
Pie (52, 3, 4, 5, 6, N, a, L, Z, Rs, Ra, Vas 
14, 44, 45, 46) 
Frain (54, 36, 3, 4, N, a, L, Rs, Ry, 
44, 45, 46, 48) 


parameters. Also, a precise technique developed for measuring the dc 
parameters was shown. Using 11 sample devices chosen from a wide 
variety of designs, usefulness of the new techniques was demonstrated. 

The determined values of the basic channel parameters for the 
sample devices were used to calculate their dc parameters, such as the 
maximum channel current, zero-gate-bias channel current, and 
transconductance in the simple, analytical expressions recently ob- 
tained.’ Their predicted values were then compared with the measured 
values in excellent agreement for all devices used. Practical expres- 
sions, in terms of device geometrical and material parameters, devel- 
oped for parasitic resistances were verified in good agreement with 
measured values on all sample devices. 

Using the sample devices, it was demonstrated that the maximum 
output power and minimum noise figure at microwave frequencies can 
be predicted by de parameters as well as by device geometrical and 
material parameters through simple, analytical expressions. In other 
words, proper dc characterization of a GaAs MESFET makes it possible 
to predict the microwave power handling capability and minimum 
noise property. 

Finally, the relationships between the dc and rf performance param- 
eters of a GaAs MESFET and its geometrical and material parameters 
were summarized with the relevant equations. This summary would 
be very useful as a handy reference for the design and optimization 
processes of a GaAs MESFET. 
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